Hapax analyses the vocabulary of software systems. I apply search engine technology (Latent Semantic Analysis) to analyse the vocabulary and topics of software. In my research, I used Hapax for
Hapax’s approach is programming language independent as it is based on identifier names and comments only.
The original Hapax is written in Visualworks Smalltalk. Recently, Romain an me started porting Hapax to Squeak, but its not yet done, see