A method and system is provided for integrating multiple feature spaces in a k-means clustering algorithm when analyzing data records having multiple, heterogeneous feature spaces. The method assigns different relative weights to these various features spaces. Optimal feature weights are also determined that lead to a clustering that simultaneously minimizes the average intra-cluster dispersion and maximizes the average inter-cluster dispersion along all the feature spaces. Examples are provided that empirically demonstrate the effectiveness of feature weighting in clustering using two different feature domains.

 
Web www.patentalert.com

< System and method for reliable multicast data distribution over an unreliable packet switched network

< Enhanced backup and recovery methodology

> Method and apparatus for increasing virus detection speed using a database

> Table format data presenting method, inserting method, deleting method, and updating method

~ 00212