Surf these sites: About Scatter/Gather -- The Scatter/Gather interface uses text clustering as a way to group document according to the overall similarities in their content. Scatter/Gather is so named because it allows the user to scatter documents into clusters, or groups, then gather a subset of these groups and re-scatter them to form new groups. About TileBars -- To many users, the way search engines choose to rank retrieved documents is a bit of a mystery. The TileBars interface is an attempt to show the user, graphically, the relationship between the words in the query and the documents retrieved. About the Cat-a-Cone -- A novel user interface that integrates search and browsing of very large category hierarchies with their associated text collections. One key insight is the separation of the representation of category labels from documents, which allows the display of multiple categories per document. Defining Data Visualization -- Don Nachtwey - Thinx Software. Data Visualization is an emerging market space that is not well defined. Data Visualization is a breed of products that feature a graphic component and a data component. But as a consumer with a data visualization requirement, I would have trouble doing a product comparison. Online Text Mining -- This site serves as a clearinghouse of information on the topic of Online Text Mining, with links to many articles and vendor sites relevant to this topic. Text Mining and the Knowledge Management Space -- A Semio Corporation white paper. Text Mining: Beyond Search Technology -- Patricia Soto. The rise of text mining is taking search engine capabilities to the next level. Here''s how to reap the advantages while avoiding the pitfalls of implementing this new technology within your organization. Untangling Text Data Mining -- Defines data mining, information access, and corpus-based computational linguistics, and then discusses the relationship of these to text data mining. The intent behind these contrasts is to draw attention to exciting new kinds of problems for computational linguists.
Help build the largest human-edited
directory on the web.