1. [ホーム]
  2. [研究業績]
  3. [研究業績詳細]

研究業績詳細

タイトル 学習データが少量しかない場合の文書分類に関する一考察
著者 前田康成 、吉田秀樹 、鈴木正清 、松嶋敏泰
年度 2011
形式 論文誌
分野 知識情報処理
掲載雑誌名 電気学会論文誌
掲載号・ページ vol.131, no.8, pp.1459-1466
掲載年 2011
掲載月 8
アブスト
(日本語)
査読:有
DOI:10.1541/ieejeiss.131.1459
アブスト
(英語)
Document classification is one of important topics in the field of NLP(Natural Language Processing). In the previous research a document classification method has been proposed which minimizes an error rate with reference to a Bayes criterion. But when the number of documents in training data is small, the accuracy of the previous method is low. So in this research we propose a new document classification method using estimating data in order to estimate prior distributions, which is based on the previous method. When the training data is small the accuracy of the proposed method is higher than the accuracy of the previous method. But when the training data is big the accuracy of the proposed method is lower than the accuracy of the previous method. So in this research we also propose another document classification method whose accuracy is higher than the accuracy of the previous method when the training data is small, and is almost the same as the accuracy of the previous method when the training data is big.
備考
(日本語)
3
備考
(英語)
3
論文原稿
発表資料

関連論文