Sources of operational problems in business transactions often show
themselves in relatively small pockets of data, which are called trouble
hot spots. Identifying these hot spots from internal company transaction
data is generally a fundamental step in the problem's resolution, but
this analysis process is greatly complicated by huge numbers of
transactions and large numbers of transaction variables to analyze. A
suite of practical modifications are provided to data mining techniques
and logistic regressions to tailor them for finding trouble hot spots.
This approach thus allows the use of efficient automated data mining
tools to quickly screen large numbers of candidate variables for their
ability to characterize hot spots. One application is the screening of
variables which distinguish a suspected hot spot from a reference set.