HomeMachine LearningPCA – Explained

PCA – Explained

This notebook loads the breast cancer dataset and inspects its shape, duplicates, missing values, and class balance.

It separates features and target, then splits data into training and test sets.

The features are standardized before PCA is applied. PCA is used to compute explained variance and reduce the dataset from 30 to 10 principal components.

A Random Forest is trained on the reduced PCA data and evaluated.

Another Random Forest is trained on the original features for comparison.

Finally, it compares accuracy and discusses how PCA reduces dimensionality while preserving most variance.

PCA_1_explained_with_example_breast_cancer

Share: 

No comments yet! You be the first to comment.

Leave a Comment

Your email address will not be published. Required fields are marked *