From SWRC

Introduction

Hannanum is a morphological analyzer and a POS tagger which is plug-in component architecture-based tool. Hannanum was developed in Java, so this tool is available in every platform which has JRE.
(1999 ~ )

When you analyze natural language text, a word may have different part-of-speech depending on the context. Part-of-speech tagging is to solve this ambiguity. After POS tagging process, we can know the role of the word and the structure of the sentence. Korean morphological analyzer is software that has Korean text as an input, and separates them with morphemes. The morphological analysis and POS tagging result is basic and important for natural language processing.

In each natural language, POS tagging methods need to be different regarding the characteristics of the target language. In the case of English that is an inflectional language it is easier to get morphemes in a sentence by segmenting the word on blanks, but in the case of Korean that is an inflectional language more than one morpheme can be connected and affect each other. So combination of the ambiguity of morpheme detection and part-of-speech tagging makes it more complex to analyze Korean text.

Architecture of HanNanum (java version)

The Java version of HanNanum Morphological Analyzer adapted plug-in component architecture for more flexible use. Users are allowed to set the work flow up for own purpose using the plug-ins already developed, and developers can implement new plug-ins easily using existing system and resources.

Features of HanNanum

Platform Independent: HanNanum can be run on any system with JRE 1.6 or above.

The resources are opened so users are allowed to edit and use them under the license.

Flexible Architecture: Easy to add a new functionality by implementing just a plug-in.

Easy to use: Add the library jhannanum.jar to your project, then you are ready to use.

Support Multi-thread and Single-thread mode.

Based on UTF-8 that has higher compatibility than EUC-KR

Workflow Examples

Followings are the examples of HanNanum work flows that analyze Korean text for own different purpose. You can easily test these examples using the example programs on kr.ac.kaist.swrc.jhannanum.demo.* or GUIDemo in the HanNanum release. To download HanNanum, visit KLDP Download (KO) or SourceForge Download (EN).