Klasifikasi dokumén
Pidangan
Klasifikasi dokumén nyaéta hiji masalah dina élmu informasi. Tugasna taya lian pikeun nempatkeun hiji dokumén kana hiji atawa sababaraha kategori, dumasar eusina. Pancén klasifikasi dokumén bisa dibagi kana dua cara: supervised document classification where some external mechanism (such as human feedback) provides information on the correct classification for documents, and unsupervised document classification, where the classification must be done entirely without reference to external information.
Téhnik klasifikasi dokumén ngawengku:
and approaches based on natural language processing.
A recent notable use of document classification techniques has been spam filtering which tries to discern E-mail spam messages from legitimate emails.
Tempo ogé
[édit | édit sumber]Rujukan
[édit | édit sumber]- Wikipédia basa Inggris, disalin ping 31 Désémber 2004.
Tumbu kaluar
[édit | édit sumber]- Rafael A. Calvo, Jae-Moon Lee and Xiaobo Li. Managing Content with Automatic Document Classification Archived 2004-12-31 di Wayback Machine. Journal of Digital Information, Volume 5 Issue 2, Article No. 282, 2004-06-08
- Introduction to document classification Archived 2007-06-13 di Wayback Machine
Artikel ieu mangrupa taratas, perlu disampurnakeun. Upami sadérék uninga langkung paos perkawis ieu, dihaturan kanggo ngalengkepan. |