A system that facilitates data processing includes a receiver component
that receives an SQL query. A partitioning component partitions the SQL
query into multiple tasks and provides the tasks to multiple cluster
nodes for processing. The system enables very large amounts of data
(e.g., multiple terabytes) to be quickly prepared for analytical
processing, such as for use in connection with a search engine, an
advertisement provision system, etc.