Frequently Asked Questions
Which tagger should I use?
A part-of-speech tagger consists of an algorithm (e.g., Viterbi algorithm) on top of a model (e.g., Hidden Markov Model) typically constructed from a corpus (e.g., Penn-Treebank) and algorithm. Thus, these features determine the applicability and performance of a tagger. Currently, TerMine supports GENIA tagger (http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/postagger/) and TreeTagger (http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/). Trained on the GENIA, PennBioIE, and Wall Street Journal corpora, GENIA tagger is suitable for processing bio-medical text. TreeTagger is suitable for a general English text such as newspaper articles.
BackFeatured News
- NaCTeM success at EMNLP 2025 - 7/7 papers accepted
- 1st Workshop on Misinformation Detection in the Era of LLMs - Presentation slides now available
- Prof. Ananiadou appointed Deputy Director of the Christabel Pankhurst Institute
- ELLIS Workshop on Misinformation Detection - Presentation slides now available
- Prof. Sophia Ananiadou accepted as an ELLIS fellow
- BioNLP 2025 and Shared Tasks accepted for co-location at ACL 2025
- Prof. Junichi Tsujii honoured as Person of Cultural Merit in Japan
Other News & Events
- AI for Research: How Can AI Disrupt the Research Process?
- CL4Health @ NAACL 2025 - Extended submission deadline - 04/02/2025
- Invited talk at the 15th Marbach Castle Drug-Drug Interaction Workshop
- Participation in panel at Cyber Greece 2024 Conference, Athens
- Shared Task on Financial Misinformation Detection at FinNLP-FNP-LLMFinLegal








