Publishing Service

Polishing & Checking

Journal of Zhejiang University SCIENCE A

ISSN 1673-565X(Print), 1862-1775(Online), Monthly

An improved TF-IDF approach for text classification

Abstract: This paper presents a new improved term frequency/inverse document frequency (TF-IDF) approach which uses confidence, support and characteristic words to enhance the recall and precision of text classification. Synonyms defined by a lexicon are processed in the improved TF-IDF approach. We detailedly discuss and analyze the relationship among confidence, recall and precision. The experiments based on science and technology gave promising results that the new TF-IDF approach improves the precision and recall of text classification compared with the conventional TF-IDF approach.

Key words: Term frequency/inverse document frequency (TF-IDF), Text classification, Confidence, Support, Characteristic words


Share this article to: More

Go to Contents

References:

<Show All>

Open peer comments: Debate/Discuss/Question/Opinion

<1>

roger<rogerchunh@systex.com.tw>

2014-09-05 01:00:49

good ,thanks for sharing

saba@rashid<sabafaraz2013@hotmail.com>

2014-03-16 19:30:24

want to read this paper

Linlin Gao@Harbin Engineering University<gll\_89@163.com>

2013-09-26 14:40:47

Look forword to reading the full paper!

Please provide your name, email address and a comment





DOI:

10.1631/jzus.2005.A0049

CLC number:

TP31

Download Full Text:

Click Here

Downloaded:

5209

Clicked:

8076

Cited:

0

On-line Access:

Received:

2003-12-05

Revision Accepted:

2004-06-26

Crosschecked:

Journal of Zhejiang University-SCIENCE, 38 Zheda Road, Hangzhou 310027, China
Tel: +86-571-87952276; Fax: +86-571-87952331; E-mail: jzus@zju.edu.cn
Copyright © 2000~ Journal of Zhejiang University-SCIENCE