Abstract:
An apparatus and method for analyzing bottlenecks in a data distributed processing system. The apparatus includes a learning unit mining and learning bottleneck-feature association rules based on hardware information related to a bottleneck node, job configuration information related to a bottleneck causing job, and/or I/O information regarding a bottleneck causing task. Based on the bottleneck-feature association rules, a bottleneck cause analyzing unit detects a bottleneck node among multiple nodes performing tasks in the data distributed processing system, and analyzes the bottleneck cause.