MouthShut.com Would Like to Send You Push Notifications. Notification may includes alerts, activities & updates.

For Business

An efficient document clustering algorithm and its. Review on Raw Politics

Upload Photo

MouthShut Score

50%

3 Votes

Acting:

Plot:

Mass Appeal:

Look & Feel:

Owner? Claim this Business

× Upload your product photo

Supported file formats : jpg, png, and jpeg

Address

Contact Number

Cancel

khusboo

mumbai India

2 Reviews

66 Followers

An efficient document clustering algorithm and its

Apr 14, 2009 01:12 PM 2848 Views

Acting:

Plot:

Mass Appeal:

Look & Feel:

We present an efficient document clustering algorithm that uses a term frequency vector for each document instead of using a huge proximity matrix.

The algorithm has the following features:

(1) It requires a relatively small amount of memory and runs fast,

(2) It produces a hierarchy in the form of a document classification tree and (

3) The hierarchy obtained by the algorithm explicitly reveals a collection structure. We confirm these features and thus show the algorithm's feasibility through clustering experiments in which we use two collections of Japanese documents, the sizes of which are 83, 099 and 14, 701 documents. We also introduce an application of this algorithm to a document browser.

This browser is used in our Japanese-to-English Translation aid system. The browsing module of the system consists of a huge database of Japanese news articles and their English translations.The Japanese article collection is clustered into a hierarchy by our method. Since each node in the hierarchy corresponds to a topic in the collection, we can use the hierarchy to directly access articles by topic. A user can learn general translation knowledge of each topic by browsing the Japanese articles and their English translations. We also discuss techniques of presenting a large tree-formed hierarchy on a computer screen.

- Flag This Review
- Irrelevant
- Fake
- Junk
Thank You! We appreciate your effort.

Brand Manager? Reply as a Brand

Upload Photos

Upload photo files with .jpg, .png and .gif extensions. Image size per photo cannot exceed 10 MB

Comment on this review

Read All Reviews

Guest

YOUR RATING ON

Raw Politics

Notify me when there is a new review