bioinformatics chat

the bioinformatics chat

#8 Perfect k-mer hashing in Sailfish

August 5, 2017

The original version of Sailfish, an RNA-Seq quantification tool, used minimal perfect hash functions to replace k-mers with unique integers. (The current version appears to be using a Cuckoo hashmap instead.)

This is my attempt to explain how a minimal perfect hash function could be built. The algorithm described here is not exactly the same as the one Sailfish used, but it follows the same idea.

Download MP3



Don't miss the next episode! Subscribe via iTunes, Android, Stitcher, RSS, or email.

Music: Eric Skiff — Come and Find Me (modified, licensed under CC BY 4.0).