A method for organizing deep Web services is provided. In one aspect, the
method obtains a collection of sources and their associated attributes
and/or input modes, for instance, using a crawling algorithm. The method
uses this information to organize the sources into communities. A mining
algorithm such as the hyperclique mining algorithm is used to obtain
cliques of highly correlated attributes. A clustering algorithm such as
the hierarchical agglomerative clustering algorithm is used to further
cluster the cliques of attributes into larger cliques, which in the
present disclosure is referred to as signatures. The sources that are
associated with each signature form a community and a graph
representation of the communities is constructed, where the vertices are
communities and the edges are the shared attributes.