The UNITE database for molecular identification and taxonomic communication of fungi and other eukaryotes: sequences, taxa and classifications reconsidered

Kessy Abarenkov, R Henrik Nilsson, Karl-Henrik Larsson, Andy F S Taylor, Tom W May, Tobias Guldberg Frøslev, Julia Pawlowska, Björn Lindahl, Kadri Põldmaa, Camille Truong, Duong Vu, Tsuyoshi Hosoya, Tuula Niskanen, Timo Piirmann, Filipp Ivanov, Allan Zirk, Marko Peterson, Tanya E Cheeke, Yui Ishigami, Arnold Tobias JanssonThomas Stjernegaard Jeppesen, Erik Kristiansson, Vladimir Mikryukov, Joseph T Miller, Ryoko Oono, Francisco J Ossandon, Joana Paupério, Irja Saar, Dmitry Schigel, Ave Suija, Leho Tedersoo, Urmas Kõljalg

Research output: Contribution to journalArticlepeer-review

Abstract

UNITE (https://unite.ut.ee) is a web-based database and sequence management environment for molecular identification of eukaryotes. It targets the nuclear ribosomal internal transcribed spacer (ITS) region and offers nearly 10 million such sequences for reference. These are clustered into ∼2.4M species hypotheses (SHs), each assigned a unique digital object identifier (DOI) to promote unambiguous referencing across studies. UNITE users have contributed over 600 000 third-party sequence annotations, which are shared with a range of databases and other community resources. Recent improvements facilitate the detection of cross-kingdom biological associations and the integration of undescribed groups of organisms into everyday biological pursuits. Serving as a digital twin for eukaryotic biodiversity and communities worldwide, the latest release of UNITE offers improved avenues for biodiversity discovery, precise taxonomic communication and integration of biological knowledge across platforms.

Original languageEnglish
Article numbergkad1039
Pages (from-to)1-7
Number of pages7
JournalNucleic Acids Research
Early online date11 Nov 2023
DOIs
Publication statusE-pub ahead of print - 11 Nov 2023

Bibliographical note

Acknowledgements
We acknowledge Marie Zirk for her work in designing the UNITE logotype and creating the visual abstract for this article.

Funding
UNITE database development is financed by the Estonian Research Council [PRG1170]; European Union's Horizon 2020 project BGE [101059492]. The PlutoF digital infrastructure is supported by the European Union's Horizon 2020 project BiCIKL [101007492]; Estonian Research Infrastructure roadmap project DiSSCo Estonia. Funding for open access charge: UNITE Community.

Conflict of interest statement. None declared.

Data Availability Statement

All sequences covered in this article are deposited in public sequence databases, INSDC and UNITE. Custom reference sequence datasets for a range of metabarcoding software pipelines are available for download on UNITE Resources page (https://unite.ut.ee/repository.php). UNITE SH datasets are published with DataCite DOIs and are accessible through UNITE (https://unite.ut.ee) and PlutoF (https://plutof.ut.ee) public homepages.

Fingerprint

Dive into the research topics of 'The UNITE database for molecular identification and taxonomic communication of fungi and other eukaryotes: sequences, taxa and classifications reconsidered'. Together they form a unique fingerprint.

Cite this