A method is provided for selecting a representative set of training data for training a statistical model in a machine condition monitoring system. The method reduces the time required to choose representative samples from a large data set by using a nearest-neighbor sequential clustering technique in combination with a kd-tree. A distance threshold is used to limit the geometric size the clusters. Each node of the kd-tree is assigned a representative sample from the training data, and similar samples are subsequently discarded.

 
Web www.patentalert.com

< Manual start learning process and manual start process for use with an automated system

< Voice to text conversion with keyword parse and match to semantic and transactional concepts stored in a brain pool state machine using word distance to generate character model interaction in a plurality of dramatic modes

> Method and apparatus using a classifier to determine semantically relevant terms

> Automatic invocation of computational resources without user intervention across a network

~ 00605