Download Area
Here you can find benchmark collections I have used in my research, as well as software I have developed. Don't forget to come back again. The page is updated often.
- Benchmark Data Sets
-
• Information Retrieval
CACM
CRANFIELD
NPL
MEDLINE
TIMES
• Word Sense Disambiguation (WORDNET 2.0 disambiguation)
Senseval 2
Senseval 3
- NLP Tools and Other Resources
-
• NLP Tools
Porter Stemmer in Java
Link to Stanford Log-Linear POS Tagger
• Other Resources
Stopword list