maybe u can help me discussing this topic :)
by the way.. i must implement TF-IDF with Vector Space Model and K-Mean clustering for searching similiar document. which is better? or I must implement all of these method?
maybe u can contact me via email :D
vans (dot) emperor (at) gmail (dot) com