Abstract: This paper chooses the relevant characteristics of the customers who buy insurance as the research object, and based on the methods of correlation analysis and logistic regression, predicts the renewal rate of the people who buy insurance. Firstly, the data are preprocessed, and the available variables are transformed into virtual variables for correlation analysis. In this paper, two different types of data, discrete and continuous, are virtualized. For discrete data, discrete intervals or attributes can be directly transformed into ...