NaCTeM

Selected Publications

2009

  • Ananiadou, S. (2009)
    Text Mining for Biomedicine. In Violaine Prince and Mathieu Roche (Eds.), Information Retrieval in Biomedicine: Natural Language Processing for Knowledge Integration, IGI Global. In press
  • Ananiadou, S., Weissenbacher, D., Rea, B., Pieri, E., Lin, Y., Vis, F., Procter, R. & Halfpenny P. (2009)
    Supporting Frame Analysis using Text Mining. In Proceedings of the 5th International Conference on e-Social Science. (to appear)
  • Ananiadou, S., Okazaki, N., Procter, R., Rea, B., & Thomas, J. (2009)
    Supporting Systematic Reviews using Text Mining. In Social Science Computer Review, special issue.
  • Kano, Y, Baumgartner Jr., W.A, McCrohon, L., Ananiadou, S., Cohen, K.B., Hunter, L. & Tsujii, J. (2009)
    U-Compare: share and compare text mining tools with UIMA. Bioinformatics.
  • Kano, Y. McCrochon L., Ananiadou, S. & Tsujii, J. (2009)
    Integrated NLP Evaluation System for Pluggable Evaluation Metrics with Extensive Interoperable Toolkit. In Proceedings of NAACL HLT 2009 (to appear).
  • Piao, S., Tsuruoka, Y. & Ananiadou, S. (2009)
    HYSEAS: A HYbrid SEntiment Analysis System. In Proceedings of the Fourth International Conference on Interdisciplinary Social Sciences. (to appear)
  • Sasaki, Y., Thompson, P., McNaught, J. & Ananiadou, S. (2009)
    Three BioNLP Tools Powered by the BioLexicon. In Proceeedings of EACL 2009 Demonstration Session, pp 61-64
  • Sierra, G.E., Cedeņo, A.B. & Ananiadou, S. (2009)
    An Improved Automatic Term Recognition method for Spanish. In Proceedings of tht 10th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2009). pp. 125 - 136
  • Tsuruoka, Y., Tsujii, J., & Ananiadou, S. (2009).
    Stochastic Gradient Descent Training for L1-regularized Log-linear Models with Cumulative Penalty. ACL-IJCNLP 2009. (to appear)
  • Tsuruoka, Y., Tsujii, J. & Ananiadou, S. (2009)
    Fast Full Parsing by Linear-Chain Conditional Random Fields. In Proceedings of the 12th Conference of European Chapter of the Association for Computational Linguistics (EACL-09), pp. 790-798
  • Venturi, G., Montemagni, S., Marchi, S., Sasaki, Y., Thompson, P., McNaught, J. & Ananiadou, S. (2009)
    Bootstrapping a Verb Lexicon for Biomedical Information Extraction. In Proceedings of the 10th International Conference on Intelligent Text Processing and Computational Linguistics (CICLing 2009), pp 137- 148.

2008

2007

2006

  • Ananiadou, S. & McNaught, J. (2006)
    Text Mining for Biology and Biomedicine. Artech House Books.
  • Ananiadou, S. & McNaught, J. (2006)
    Introduction to Text Mining in Biology . In Ananiadou, S. & McNaught, J. (Eds) Text Mining for Biology and Biomedicine, pp. 1-12, Artech House Books.
  • Ananiadou, S. & Nenadic, G. (2006)
    Automatic Terminology Management in Biomedicine . In Ananiadou, S. & McNaught, J. (Eds) Text Mining for Biology and Biomedicine, pp. 67-98, Artech House Books.
  • Kim, J.D, Ananiadou, S. and Tsujii, J. (2006)
    Ontology based Semantic Annotation for Knowledge Extraction, in 2nd International Digital Curation Conference, Glasgow
  • Ananiadou, S. and Fluck, J. (eds.) (2006)
    Second International Symposium on Semantic Mining in Biomedicine (SMBM), BMC Bioinformatics 2006, 7 (Suppl 3) : S1  (forthcoming, November 2006)
  • Ananiadou, S., Kell, D.B. and Tsujii, J. (2006)
    Text Mining and its Potential Applications in Systems Biology, in Trends in Biotechnology (TIBTECH) (accepted)
  • Chun, Hong-woo, Yoshimasa Tsuruoka, Jin-Dong Kim, Rie Shiba, Naoki Nagata, Teruyoshi Hishiki, and Jun'ichi Tsujii. Extraction of Gene-Disease Relations from MedLine using Domain Dictionaries and Machine Learning. The Pacific Symposium on Biocomputing (PSB) Maui , Hawaii , USA , pp. 4-15, January 2006.
    [PDF]
  • Chun, Hong-woo, Yoshimasa Tsuruoka, Jin-Dong Kim, Rie Shiba, Naoki Nagata, Teruyoshi Hishiki, and Jun'ichi Tsujii. Automatic Recognition of Topic-Classified Relations between Prostate Cancer and Genes from Medline Abstracts. In the Proceedings of the second international symposium on semantic mining in Biomedicine. Jena , Germany , pp. 5-12, April 2006.
    [PDF]
  • Daisuke, OKanohara, Yusuke Miyao, Yoshimasa Tsuruoka and Jun¡Çichi Tsujii. Improving the Scalability of Semi-Markov Conditional Random Fields for Named Entity Recognition. In the Proceedings of ACL 2006. Sydney , Australia , July 2006.
    [PDF]
  • Kim, Jin-Dong and Jun'ichi Tsujii. Corpora and their Annotation. In Sophia Ananiadou and John McNaught, (Eds.), Text Mining for Biology and Biomedicine. 46 Gillingham Street , London SW1V 1AH UK , Artech House, 2006.
    [An introduction at the publisher's homepage]
  • Mima, H., Ananiadou, S. & Katsushima, M. (2006)
    Terminology-based Knowledge Mining for New Knowledge Discovery, in ACM Transactions on Asian language information processing (TALIP) Special Issue on Text Mining and Management in Biomedicine, Vol. 5, No. 1, March 2006, 74-88
  • Miyao, Yusuke, Tomoko Ohta, Katsuya Masuda, Yoshimasa Tsuruoka, Kazuhiro Yoshida, Takashi Ninomiya and Jun'ichi Tsujii. Semantic Retrieval for the Accurate Identification of Relational Concepts in Massive Textbases. In the Proceedings of COLING-ACL 2006. Sydney , Australia , pp. 1017--1024, July 2006.
    [PDF]
  • Nenadic, G. & Ananiadou, S. (2006)
    Mining Term Associations from Biomedical Literature to Support Knowledge Discovery, in ACM Transactions on Asian language information processing (TALIP) Special Issue on Text Mining and Management in Biomedicine, Vol. 5, No. 1, March 2006, 1-22 [pdf]
  • Ninomiya, Takashi, Takuya Matsuzaki, Yoshimasa Tsuruoka, Yusuke Miyao and Jun'ichi Tsujii. Extremely Lexicalized Models for Accurate and Fast HPSG Parsing. In the Proc. of EMNLP 2006. Sydney , Australia , pp. 155--163, July 2006.
    [PDF]
  • Ohta, Tomoko, Yuka Tateisi, Jin-Dong Kim, Akane Yakushiji and Jun-ichi Tsujii. Linguistic and Biological Annotations of Biological Interaction Events. In the Proceedings of The Fifth International Conference on Language Resource and Evaluation (LREC 2006). Genoa , Italy , pp. 1405--1408, May 2006.
  • Ohta, Tomoko, Yusuke Miyao, Takashi Ninomiya, Yoshimasa Tsuruoka, Akane Yakushiji, Katsuya Masuda, Jumpei Takeuchi, Kazuhiro Yoshida, Tadayoshi Hara, Jin-Dong Kim, Yuka Tateisi and Jun'ichi Tsujii. An Intelligent Search Engine and GUI-based Efficient MEDLINE Search Tool Based on Deep Syntactic Parsing. In the Proceedings of the COLING/ACL 2006 Interactive Presentation Sessions. Sydney , Australia , pp. 17--20, July 2006.
    [PDF]
  • Okazaki, N. and Ananiadou, S. (2006)
    Clustering acronyms in biomedical text for disambiguation, in Proceedings of LREC.
  • Okazaki, N. and Ananiadou, S. (2006)
    Building an Abbreviation Dictionary using a Term Recognition Approach, in Bioinformatics, (accepted)
  • Rebholz-Schuhmann, D., Kirsch, H., Nenadic, G.: .: IeXML: towards an annotation framework for biomedical semantic types enabling interoperability of text processing modules, Proceedings of Joint BioLINK and Bio-Ontologies SIG Meeting, ISMB 2006, Fortaleza, Brazil
  • Unno, Yuya, Takashi Ninomiya, Yusuke Miyao and Jun'ichi Tsujii. Trimming CFG Parse Trees for Sentence Compression Using Machine Learning Approaches. In the Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions. Sydney , Australia , pp. 850--857, Association for Computational Linguistics, July 2006.
    [PDF][poster]
  • Yakushiji, Akane, Miyao, Yusuke, Ohta, Tomokoand Tateisi, Yuka and Tsujii, Jun'ichi. Automatic Construction of Predicate-argument Structure Patterns for Biomedical Information Extraction. In the Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing. Sydney , Australia , pp. 284--292, July 2006.
    [PDF]

2005

  • Ananiadou, S., Chruszcz, J., Keane, J., McNaught, J. & Watry, P. (2005) The National Centre for Text Mining: Aims and Objectives. In Ariadne, Issue 42, January 2005. [url]
  • Chun, Hong-Woo, Young-Sook Hwang, Hae-Chang Rim. Unsupervised Event Extraction from Biomedical Literature using Co-occurrence Information and Basic Patterns. In the Lecture Notes in Artificial Intelligence. pp. 777-786, Springer, 2005. [PDF]
  • Hara, Tadayoshi, Yusuke Miyao and Jun'ichi Tsujii. Adapting a probabilistic disambiguation model of an HPSG parser to a new domain. In Robert Dale, Kam-Fai Wong, Jian Su and Oi Yee Kwong (Eds.), Natural Language Processing – IJCNLP 2005. Lecture Notes in Artificial Intelligence3651. Jeju Island , Korea , pp. 199--210, Springer-Verlag, October 2005. ISSN 0302-9743. [PS][PDF]
  • Kazama, Jun'ichi and Jun'ichi Tsujii. Maximum Entropy Models with Inequality Constraints: A case study on text categorization. Machine Learning Journal special issue on Learning in Speech and Language Technologies. 60(1-3). pp. 169-194, Springer SBM, September 2005. [Machine Learning]
  • Matsuzaki, Takuya, Yusuke Miyao and Jun'ichi Tsujii. Probabilistic CFG with Latent Annotations. In the Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics. Michigan , USA , pp. 75-82, June 2005. [PDF]
  • Miyao, Yusuke and Jun'ichi Tsujii. Probabilistic disambiguation models for wide-coverage HPSG parsing. In the Proceedings of ACL 2005. Ann Arbor , Michigan , pp. 83-90, June 2005. [PDF]
  • Ninomiya, Takashi, Yoshimasa Tsuruoka, Yusuke Miyao and Jun'ichi Tsujii. Efficacy of Beam Thresholding, Unification Filtering and Hybrid Parsing in Probabilistic HPSG Parsing. In the Proc. of IWPT 2005. Vancouver , BC , Canada , pp. 103--114, October 2005. [PS][PDF]
  • Ninomiya, Takashi, Yusuke Miyao and Jun'ichi Tsujii. A Persistent Feature-Object Database for Intelligent Text Archive Systems. In Keh-Yih Su, Jun'ichi Tsujii, Jong-Hyeok Lee and Oi Yee Kwong (Eds.), Natural Language Processing - IJCNLP 2004. LNAI3248. Hainan Island , China , pp. 197--205, Springer-Verlag, 2005. ISSN 0302-9743. [PS][PDF]
  • Nenadic, G., Spasic, I. & Ananiadou, S. (2005) Mining Biomedical Abstracts: What’s in a Term? In Keh-Yih Su, Jun’ichi Tsujii, Jong-Hyeok Lee, et al (Eds.) Natural Language Processing – IJCNLP 2004 First International Joint Conference , Lecture Notes in Computer Science vol. 3248 [pdf]
  • Okanohara, Daisuke and Jun'ichi Tsujii. Assigning Polarity Scores to Reviews Using Machine Learning Techniques. In Robert Dale, Kam-Fai Wong, Jian Su and Oi Yee Kwong (Eds.), Natural Language Processing - IJCNLP 2005. LNCS3651. Jeju Island , Korea , Springer-Verlag, October 2005. [PDF]
  • Rice, S., Nenadic, G., Stapley, B.: Mining protein function from text using term-based support vector machines, BMC Bioinformatics 2005, 6(Suppl 1):S22, "Critical assessment of text mining methods in molecular biology" (full text)
  • Spasic, S., Ananiadou, S., McNaught, J. & Kumar, A. (2005) Text Mining and Ontologies in Biomedicine: Making Sense of Raw Text. Briefings in Bioinformatics 6(3), September 2005. [pdf]
  • Spasic, Irena, Sophia Ananiadou and Junichi Tsujii. (2005) MaSTerClass: a case-based reasoning system for the classification of biomedical terms, in Bioinformatics 21(11), pp. 2748-2758. [pdf]
  • Spasic, I. & Ananiadou, S. (2005) A Flexible Measure of Contextual Similarity for Biomedical Terms, in Proceedings of Pacific Symposium on Biocomputing (PSB, 2005), Hawaii , USA .[pdf]
  • Tsuruoka, Y., Tateishi, Y., Kim, J-D, Ohta, T. McNaught, J., Ananiadou, S. and Tsujii J. (2005) Developing a Robust Part-of-Speech Tagger for Biomedical Text, LNCS, Springer, pp. 382-392 [pdf]
  • Tsujii, J. & Ananiadou, S. (2005) Thesaurus or logical ontology, which one do we need for text mining? In Language Resources and Evaluation, Springer Science and Business Media B.V., vol. 39, no 1, 77-90. [pdf]
  • Tsuruoka, Y., Ananiadou, S. & Tsujii, J. (2005) A Machine Learning Approach to Acronym Generation, BioLink 2005. [pdf]
  • Tsuruoka, Y., Tateishi, Y., Kim, J-D, Ohta, T. McNaught, J., Ananiadou, S. and Tsujii J. (2005) Developing a Robust Part-of-Speech Tagger for Biomedical Text, LNCS, Springer, pp. 382-392 [pdf]
  • Tsujii, J. & Ananiadou, S. (2005) Thesaurus or logical ontology, which one do we need for text mining? In Language Resources and Evaluation, Springer Science and Business Media B.V., vol. 39, no 1, 77-90. [pdf]
  • Tsuruoka, Y., Ananiadou, S. & Tsujii, J. (2005) A Machine Learning Approach to Acronym Generation, BioLink 2005. [pdf]
  • Tsuruoka, Yoshimasa, Yuka Tateishi, Jin-Dong Kim, Tomoko Ohta, John McNaught, Sophia Ananiadou and Jun'ichi Tsujii. Developing a Robust Part-of-Speech Tagger for Biomedical Text. In the Advances in Informatics - 10th Panhellenic Conference on Informatics. LNCS 3746. Volos , Greece , pp. 382--392, November 2005. ISSN 0302-9743. [PDF]
  • Tsuruoka, Yoshimasa and Jun'ichi Tsujii. Iterative CKY Parsing for Probabilistic Context-Free Grammars. In Keh-Yih Su, Jun'ichi Tsujii, Jong-Hyeok Lee and Oi Yee Kwong (Eds.), Natural Language Processing - IJCNLP 2004. LNAI 3248. Hainan Island , China , pp. 52-60, Springer-Verlag, 2005. ISSN 0302-9743. [PDF]
  • Tsuruoka, Yoshimasa and Jun'ichi Tsujii. Bidirectional Inference with the Easiest-First Strategy for Tagging Sequence Data. In the Proceedings of HLT/EMNLP 2005. Vancouver , BC , Canada , pp. 467-474, October 2005. [PDF]
  • Tsuruoka, Yoshimasa and Jun'ichi Tsujii. Chunk Parsing Revisited. In the Proceedings of the 9th International Workshop on Parsing Technologies (IWPT 2005). Vancouver , BC , Canada , pp. 133-140, October 2005. [PDF]
  • Yakushiji, Akane, Yusuke Miyao, Yuka Tateisi and Jun'ichi Tsujii. Biomedical Information Extraction with Predicate-Argument Structure Patterns. In the the Proceedings of the First International Symposium on Semantic Mining in Biomedicine. Hinxton, Cambridgeshire , UK , pp. 60--69, 2005. [PDF]

2004

  • Ananiadou, S., Friedman, C. & Tsujii, J (2004) Introduction to Named Entity Recognition in Biomedicine, editorial, Special Issue, Journal of Biomedical Informatics, vol. 37 (6), 393-395
  • Chun, Hong-Woo, Tomoko Ohta, Jin-Dong Kim and Jun'ichi Tsujii. Building Patterns for Biomedical Event Extraction. The 15th International conference on Genome Informatics (GIW)(163--164). December 2004. [PDF]
  • Kim, Jin-Dong, Tomoko Ohta, Yoshimasa Tsuruoka, Yuka Tateisi and Nigel Collier. Introduction to the Bio-Entity Recognition Task at JNLPBA. In the Proceedings of the International Workshop on Natural Language Processing in Biomedicine and its Applications (JNLPBA-04). pp. 70--75, 2004. [PDF]
  • Krauthammer, M., Nenadic, G.: Term Identification in the Biomedical Literature, Journal of Biomedical Informatics (Special Issue on Named Entity Recognition in Biomedicine), Vol. 37(6):512-526, 2004 (pre-print)
  • Miyao, Yusuke and Jun'ichi Tsujii. Deep Linguistic Analysis for the Accurate Identification of Predicate-Argument Relations. In the Proceedings of COLING 2004. Geneva , Switzerland , pp. 1392-1397, 2004. [PDF]
  • Nakanishi, Hiroko, Yusuke Miyao and Jun'ichi Tsujii. An Empirical Investigation of the Effect of Lexical Rules on Parsing with a Treebank Grammar. In the Proceedings of the third TLT2004. Tubingen , Germany , pp. 103--114, 2004. [PDF]
  • Nenadic, G., Ananiadou, S. & McNaught, J. (2004) Enhancing Automatic Term Recognition through Term Variation, in Proceedings of 20 th Int. Conference on Computational Linguistics, Coling 2004, Geneva, Switzerland. [pdf]
  • Spasic, I. & Ananiadou, S. (2004) Using Automatically Learnt Verb Selectional Preferences for Classification of Biomedical Terms, in Journal of Biomedical Informatics, vol.37, (6), 483-497. [pdf]
  • Nenadic, G., Spasic, I. , Ananiadou, S. (2004) Automatic Discovery of Term Similarities Using Pattern Mining, in International Journal ofTerminology.10:1, 55-80 [pdf]
  • Spasic, I. , Nenadic, G. & Ananiadou, S. (2004) Learning to Classify Biomedical Terms through Literature Mining and Genetic Algorithms. In Zheng Rong Yang, Richard Everson & Hujun Yin (Eds.) Intelligent Data Engineering and Automated Learning – IDEAL 2004, LNCS 3177: 345-351: Springer-Verlag. [pdf]
  • Tateisi, Yuka and Jun'ichi Tsujii. Part-of-Speech Annotation of Biology Research Abstracts. In the Proceedings of 4th International Conference on Language Resource and Evaluation (LREC2004). IV. Lisbon , Portugal , pp. 1267-1270, May 2004. [PDF]
  • Tsuruoka, Yoshimasa and Jun'ichi Tsujii. Improving the Performance of Dictionary-based Approaches in Protein Name Recognition. Journal of Biomedical Informatics. 37(6). pp. 461-470, Elsevier, 2004. [link]
  • Tsuruoka, Yoshimasa, Yusuke Miyao and Jun'ichi Tsujii. Towards efficient probabilistic HPSG parsing: integrating semantic and syntactic preference to guide the parsing. In the Proceedings of IJCNLP-04 Workshop: Beyond shallow analyses - Formalisms and statistical modeling for deep analyses. Hainan Island , China , 2004. [PDF]