Other Student Works
Document Type
Article
Publication Date
Spring 2-19-2020
Abstract
Parts of speech (POS) tagging is the process of assigning a word in a text as corresponding to a part of speech based on its definition and its relationship with adjacent and related words in a phrase, sentence, or paragraph. POS tagging falls into two distinctive groups: rule-based and stochastic. In this paper, a rule-based POS tagger is developed for the English language using Lex and Yacc. The tagger utilizes a small set of simple rules along with a small dictionary to generate sequences of tokens.
Recommended Citation
Pham, B. (2020). Parts of Speech Tagging: Rule-Based. Retrieved from https://digitalcommons.harrisburgu.edu/cisc_student-coursework/2
Start Page No.
1
End Page No.
6
Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 International License.
Comments
An article written on a rule-based part of speech tagger implemented in C through Flex and Yacc.