詞類標記 的英文怎麼說
中文拼音 [cílèibiāojì]
詞類標記
英文
pos-tagging- 詞 : 名詞1 (說話或詩歌、文章、戲劇中的語句) speech; statement; lines of play 2 (一種韻文形式 起於唐...
- 類 : Ⅰ名1 (許多相似或相同的事物的綜合; 種類) class; category; kind; type 2 (姓氏) a surname Ⅱ動詞...
- 標 : Ⅰ名詞1 [書面語] (樹梢) treetop; the tip of a tree2 (枝節或表面) symptom; outside appearance; ...
- 記 : Ⅰ動詞1 (把印象保持在腦子里) remember; bear in mind; commit to memory 2 (記錄; 記載;登記) writ...
- 標記 : (標志; 記號; 貨物標記) tab; sign; stamp; peg; label; mark; flag; blip; notation; fleck; track; ...
-
This thesis enumerates four types of perfective aspect markers in bai yu jing : perfective adverb, perfective modal particle, perfective verb and perfective auxiliary verb
摘要《百喻經》是中古時期口語色彩較濃的佛經故事集,其中的完成體標記系統包括4類成員:完成體副詞、完成體語氣詞、完成體動詞和完成體助詞。At first, the text is segmented to words and converted to a sequence of part - of - speech tags ; then based on the pos tags sequence parameters and phrase - break distance information from training, markov model is used to get the most likely phrase break sequence
首先,文本進行分詞,並轉換為一列由詞性標記所組成的序列;然後使用馬爾可夫模型,利用人工標注數據庫訓練詞語連接處詞性標注序列的概率分佈和連接類型序列的距離信息,得到輸入的詞性標記序列對應的具有最大似然概率的連接類型序列,最後利用后處理規則進行適當的糾錯。The categories of preprocessing token are : header names, identifiers, preprocessing numbers, character literals, string literals, preprocessing - op - or - punc, and single non - white - space characters that do not lexically match the other preprocessing token categories
預處理標記有以下幾類:頭名稱,標識符,預處理數字,字元文字量,字元串文字量,預處理運算符或標點,以及不符合以上詞法類型的單個非空白字元。Pos tagset selection in the past was mainly done by experts using human knowledge manually, since there is no automatic or semi - automatic way to assist the selection process
摘要過去詞類標記集的選擇主要基於專家的經驗知織,缺乏自動或半自動的方法來輔助這一過程。With difference from the way that traditional methods perform by accumulating the frequency of keywords. we propose a new metrical function that employs the rs - based entropy by comparing function values to measure the feature of web pages. besides, according to the unstructured and heterogenous characteristics of www, the effect of hypertext tags to keywords " weigh has been taken into account to obtain the most effective keywords for document classification
有別于傳統的對關鍵字頻度進行累加的方法,本文提出了基於信息熵的文本關鍵詞測度函數,通過對關鍵詞函數值進行比較,獲取對文本分類最具影響性的關鍵詞序列;同時,針對web上異質、非結構化信息的特點,該分類演算法還考慮了超文本標記對關鍵詞權值的影響。After one minute or two, the page is translated. then you can click words to mark them
在翻譯后的頁面中點擊單詞,可以將單詞標記為各種類型。Second, the term extraction transformation tags words as one of the following parts of speech
第二,字詞提取轉換將單詞標記為下列詞類之一:Then user browses the web pages, selects interested web page as sample page and create conceptual schema based on the understanding of content of the sample page. thirdly, the user marks the interested information blocks in sample page and build up the correspondence between the interested information blocks of sample page and fields of conceptual schema, and at the meanwhile system passes the correspondence to learning module to form extraction rules and stores them into rule database. fourthly, extraction module extracts information from the similar - structured pages by using the extraction ailes and sends the extracted results to classified cache database for users " further query
該系統首先將web信息按模式和風格劃分為不同領域,建立領域知識庫,以便對用戶的查詢進行導航並縮小查詢范圍;然後利用現有的搜索引擎實現關鍵詞查詢,利用瀏覽功能尋找感興趣的網頁,對找到的網頁根據其內容建立自定義的用戶概念模式;利用概念模式對網頁進行標記,使概念模式中的欄位與網頁中的信息塊對應起來,建立對應關系並傳給學習模塊以形成抽取規則並存入規則庫;抽取模塊根據規則庫中的抽取規則進行信息抽取並將抽取到的信息按用戶定義的概念模式形成記錄,按領域分類存儲于cache庫中以備用戶查詢;最後,用戶利用查詢功能實現個性化查詢。Urine absorbing aids - vocabulary - identification of product types
吸尿輔助器.詞匯.產品類型標記Essentially a dtd describes the vocabulary s structure the tags and attributes, similar to the way a database schema describes a database s structure the tables and columns
簡言之, dtd描述了詞匯表的結構(標記和屬性) ,類似於數據庫模式描述數據庫的結構(表和列) 。分享友人