Skip to content

miRBase 21 finally arrives

Apologies for the longer-than-usual wait.

miRBase 21 is now available on the website, and all data available for download on the FTP site. As usual, the release notes describe the major changes. Of particular note this time, the Genome Reference Consortium have released a new human genome assembly, GRCh38. We have therefore remapped the human microRNA dataset to this assembly, which includes the removal of a handful of duplicate entries that now map to a single locus — for example, GRCh37 had 6 loci representing miR-3118, whereas GRCh38 has only 4. In total, there is a small increase in the number of annotated human microRNA loci, to 1881. Elsewhere in the database, the increases have been larger — we have hundreds of new sequences in each of bat, horse, goat, cobra and salmon, amongst others. In total, 4196 new hairpin sequences and 5441 new mature products have been added. The work to clean up dubious and misannotated sequences also goes on, with another 72 entries in total removed from this release.

Unfortunately, at the last moment, we’ve found an issue with the update of the “high confidence” microRNA dataset. Rather than delay the release further, we’ve decided to go ahead without the “high confidence” set for now. That will follow in the next few days, with an announcement here.

As usual, please let us know (use the comments box below, or by email) if you have any questions or comments.

Posted in releases.

7 Responses

Stay in touch with the conversation, subscribe to the RSS feed for comments on this post.

  1. Pushpendra Gupta says

    I would like to know how many plant species are represented in miRBASE, and how many miRNAs are listed for all plant species taken together.

    • sam says

      You can see numbers of miRNA sequences for each pant species on the browse page, (click “expand all” and then towards the bottom). There are 73 plant species represented, with a total of 7057 miRNA loci between them.

  2. Wendy Wang says

    How many mature miRNAs in human?

  3. Mickael says

    Thanks a lot sam !

  4. Jakob Kofoed Janot says

    Will the dump files database_files/mature_read_count_by_experiment.txt.gz and database_files/mature_read_count.txt.gz be avilable online?

  5. Matth says

    First of all great work!
    Do you happen to know when the high confidence results are updated?

Some HTML is OK

or, reply to this post via trackback.