Seminar - Dr. Ann Copestake
Speaker: | Dr. Ann Copestake (University of Cambridge) |
Title: | Robust Semantic Processing for Information Extraction |
Date: | 2pm, 13th January, 2006 |
Location: | Lecture Theatre E7, Renold Building (building 8 on the campus map) |
Abstract: | Natural language processing techniques have different strengths and
weaknesses. Shallow processing may be very fast and robust, but extracts
limited information. Deep processors can produce detailed semantic representations,
but are relatively slow and brittle and require much more knowledge.
Various approaches to building combined systems have been tried, for
instance so that deep processing is only invoked on regions of text which
have been identified as interesting by shallow processors. But different
processors typically assume very different representations, which makes
it difficult to combine them flexibly. We have developed a common semantic representation language (Robust Minimal Recursion Semantics: RMRS) for deep and shallow processing. Shallow processors provide a representation which is compatible with deep processing, but relatively underspecified. In the Deep Thought project and subsequent work, we have demonstrated that various systems (including part-of-speech taggers, noun phrase chunkers, named entity recognisers, robust parsers and deep parsers) can be adapted to output RMRSs. Information extraction systems can be built which utilise RMRS markup as a base. In a current project, SciBorg, we are further developing this approach and applying it to Chemistry. We treat the different processing stages as providing levels of standoff annotation with respect to the scientific text. Our aim is to provide infrastructure that can be used by Chemistry researchers to support a variety of tasks, including enhanced search, information extraction and ontology expansion. |
Featured News
- Invited talk at the 8th Annual Women in Data Science Event at the American University of Beirut
- Invited talk at the 2nd Symposium on NLP for Social Good (NSG), University of Liverpool
- Postdoctoral research position in Athens, Greece. Application deadline: 18th March 2024
- Four-year funded PhD in collaboration with A*STAR, Singapore. Deadline 20 March 2024
- PhD opportunity in collaboration with Athens Univ. of Economics and Business. Deadline 31 Mar 2024
- iCASE EPSRC funded PhD- multimodal NLP - UoM & BAE - Application deadline 30th March 2024
- CFP: BIONLP 2024 and Shared Tasks @ ACL 2024
- Advances in Data Science and Artificial Intelligence Conference 2024
- New review article on emotion detection for misinformation
Other News & Events
- Invited talk at Annual Meeting of the Danish Society of Occupational and Environmental Medicine
- BioNLP 2024 accepted as workshop at ACL 2024
- Junichi Tsujii awarded Order of the Sacred Treasure, Gold Rays with Neck Ribbon
- Chinese Government AwardAward for PhD student Tianlin Zhang
- Keynote talk at EMBL-EBI industry club Machine Learning for Text Mining