category of speech 中文意思是什麼
category of speech
解釋
言語范疇-
Aiming at this question, the paper describes an approach to correcting the part - of - speech tagging of multi - category words automatically
針對這一難點問題,本文提出了一種兼類詞詞性標注的自動校對方法。 -
The disambiguation of multi - category words is one of the difficulties in part - of - speech tagging of chinese text, which affects the processing quality of corpora greatly
摘要兼類詞的詞類排歧是漢語語料詞性標注中的難點問題,它嚴重影響語料的詞性標注質量。 -
According to the results of close - test and open - test on the corpus of 500, 000 chinese characters, the accuracy of multi - category words ' part - of - speech tagging can be increased by 11. 32 % and 5. 97 % respectively
分別對50萬漢語語料做封閉測試和開放測試,結果顯示,校對后語料的兼類詞詞性標注正確率分別可提高11 . 32 %和5 . 97 % 。 -
Basing on it we bring forward the disambiguation strategy using rule techniques and statistics techniques. in rule model, the acqusition method of rules base is improved. we use the part - of - speech of syntactic category to replace the syntactic category. in addition, statistics method is used to help to construct the rule base. in statistics model, the concept of learning machine - made is presented. in according to the result of learning, the method of calculating transition probabilities and symbol probabilities are amended
在規則方法中,改進了規則庫的構建方法,用兼類詞詞性代替兼類詞本身,並嘗試使用統計輔助構建規則庫;在統計方法中,在二元語法模型基礎上引入了學習機制的概念,根據學習結果對詞性概率和詞匯概率的獲取方法進行了修正。 -
It acquires correction rules for the part - of - speech tagging of multi - category words from right - tagged corpora based on the rough sets and data mining, and then corrects the corpora based on these rules automatically
它利用數據挖掘的方法從正確標注的訓練語料中挖掘獲取有效信息,自動生成兼類詞詞性校對規則,並應用獲取的規則實現對機器初始標注語料的自動校對,從而提高語料中兼類詞的詞性標注質量。
分享友人