Abbreviation Extraction Project: AbbrevExtractor
n Research Problem
n Understanding abbreviations is a difficult task for humans
n Extracting abbreviation is difficult for machines
n Context information is requires
n Well-known abbreviations are not always defined
n A method to identify senses in documents is required
n Abbreviations are highly ambiguous
n Our Solution
n A hybrid approach combining lexical analysis techniques and the Support Vector Machine (SVM): AKA AbbrevExtractor
n It incorporates lexical analysis techniques to supervised learning for extracting abbreviations
n It makes use of text chunking techniques to identify long forms of abbreviations
n It significantly improves recall compared to other techniques
· AbbrevExtractor for biomedical literature can be found at AbbrevExtractor