Exploratory Data Analysis and Data Mining

I have provided for your analysis an Excel spreadsheet that contains 603 rows (records) and
41 columns (fields).  The information stems from a level 5 organization software development
problem reporting system.  Each of the fields is explained in a Microsoft Word documunt or an HTML
document.

Excel spreadsheet
Document containing the explanation and the various contents of the fields
A HTML document containg the same explanation.

For this analysis you should use exploratory data analysis and data mining techniques.  Formulate important questions that you would like to know the answer to from this rich set of information.  For example, what fault classifications are most common in the different phases of the software enginneering problems.  (For example, a subquestion: "Are logic faults most prominant in prelimary design?" Then an extension to this question, if logic faults are most common, do they require the most effort?)

Use plenty of charts and graphs to visually display your outcomes.

Due date: Thursday, December 7, 2000.