Abstract
In the background of global economic crisis, Insight into the potential relationship between today's economy, analyze the features of enterprise, has a clear understanding of the current situation of the enterprise, Thereby determine whether the risk of being written off, and focus on the enterprise with the feature of written off.
This article gain enterprise data from the database of Zhejiang Administration for Industry and Commerce, distill transform and load the data by ETL to gain zhejiang enterprise data between 2005 and 2007. In order to improve the Validity and authenticity of the data, filter the data using SPSS Clementine 10.0。The article is based on Data Mining Theory, analyze and compare C5.0 with C&R Tree algorithm , finally establish the feature analysis model of written_off enterprise using C5.0 algorithm . By answer and analyze the model,the results shows that the appearance Manufacturing enterprises are written off depends on the operating life cycle, registration bodies and other nine factors. the accuracy rate up to 95.81%, the error rate up to 4.29%. By analyzing Error Data, we come to the result the main reason of error is lack of raw data,which makes some of the impact factor can not be reflected in the model, so that the model did not meet the best, but the overall evaluation are better, this may provide support to the users.
By analyzing the characteristics of enterprises, the decision-makers can understand the current status clearer, thereby raise the right information which is benefit to all aspects of development.
Keywords:Zhejiang industrial and commercial, written_off enterprise, Data Mining, C5.0 algorithm, C&R Tree