Machine Learning - Python - SKLearn
 Machine Learning: Lung Cancer Dataset

The group aimed to find the various correlations between risk factors, lifestyle factors and symptoms of and between Lung Cancer Severity. Each group member observed a different aspect of the dataset so that we gained an overall view of what features affect lung cancer severity the most, or if the severity indicated the magnitude of one of its features. ML was done on python using SKLearn. 
The question I asked was "Can we predict whether someone with lung cancer over the age of 30 smoked regularly or not?". After filtering the data I used the decision trees ML method and managed to predict with 98.1% accuracy whether a person in the data set was a regular smoker. 

You may also like

Back to Top