sklearn
sklearn
sklearn

Sklearn Algorithm Cheat Sheet

2023-03-12

Sklearn Algorithm Cheat Sheet

This article presents a useful cheat sheet provided by Sklearn for selecting the appropriate machine learning model or algorithm based on your data type and problem.

Python
Python
sklearn
sklearn
Machine Learning
Machine Learning
Scikit-learn Pipeline for Machine Learning

2023-03-10

Scikit-learn Pipeline for Machine Learning

Scikit-learn Pipeline is a framework that streamlines the data preprocessing and model building stages of machine learning. It allows users to chain together multiple data processing and feature extraction techniques into a single pipeline, facilitating testing and experimentation while avoiding data leakage. Using Scikit-learn Pipeline saves time and resources, improves code readability, and improves the performance of machine learning models. Building a Scikit-learn Pipeline involves preprocessing data using Scikit-learn transformers, creating a Pipeline object, fitting and transforming data with the Pipeline, and tuning hyperparameters using GridSearchCV.

Python
Python
sklearn
sklearn
Converting Scikit-learn Models to ONNX and Performing Inference

2023-03-07

Converting Scikit-learn Models to ONNX and Performing Inference

This article demonstrates how to convert a Scikit-learn model into ONNX format, enabling cross-platform support and interoperability with various deep learning frameworks. We'll guide you through preparing and training a Scikit-learn model using the Iris dataset, saving the model, converting it to ONNX format, and performing inference with the ONNX model using ONNX Runtime.

Machine Learning
Machine Learning
ONNX
ONNX
sklearn
sklearn
Classification with Imbalanced Data

2023-01-20

Classification with Imbalanced Data

This article introduces effective strategies for handling imbalanced data for classification tasks in machine learning.

Machine Learning
Machine Learning
Classification
Classification
sklearn
sklearn
Pandas DataFrame Normalization

2022-12-15

Pandas DataFrame Normalization

This article explains how to conduct data normalization in Pandas DataFrame using Scikit-learn.

Python
Python
Pandas
Pandas
sklearn
sklearn
Support Vector Regression

2022-11-24

Support Vector Regression

This article explains Support Vector Regression (SVR), a powerful and versatile machine learning algorithm for predicting continuous target variables.

Machine Learning
Machine Learning
Regression
Regression
Python
Python
sklearn
sklearn
Polynomial Regression

2022-11-23

Polynomial Regression

This article covers Polynomial Regression, an extension of Linear Regression that models complex nonlinear relationships between variables.

Machine Learning
Machine Learning
Regression
Regression
Python
Python
sklearn
sklearn
K-Nearest Neighbors (KNN) Regression

2022-11-22

K-Nearest Neighbors (KNN) Regression

This article covers KNN Regression, a non-parametric supervised learning algorithm for regression tasks.

Machine Learning
Machine Learning
Regression
Regression
Python
Python
sklearn
sklearn
Ridge Regression

2022-11-22

Ridge Regression

This article explains Ridge Regression, a regularization technique used in Linear Regression models to address the issue of multicollinearity. It describes the mathematical foundation of Ridge Regression, including the cost function and L2 penalty term.

Machine Learning
Machine Learning
Regression
Regression
Python
Python
sklearn
sklearn
Lasso Regression

2022-11-21

Lasso Regression

This article covers the fundamentals of Lasso Regression, including its need for regularization and mathematical foundations.

Machine Learning
Machine Learning
Regression
Regression
Python
Python
sklearn
sklearn
Linear Regression

2022-11-20

Linear Regression

This article covers the basics of linear regression, including its definition, assumptions, and types.

Machine Learning
Machine Learning
Regression
Regression
Python
Python
sklearn
sklearn
Support Vector Machine (SVM)

2022-10-20

Support Vector Machine (SVM)

This article covers the Support Vector Machine (SVM) algorithm, including its basic concepts and terminology, the mathematics behind it, and its implementation with the Iris dataset.

Machine Learning
Machine Learning
Classification
Classification
Python
Python
sklearn
sklearn
Hierarchical Clustering

2022-10-02

Hierarchical Clustering

This article covers the basics of Hierarchical Clustering, a family of unsupervised machine learning algorithms that build a hierarchy of clusters. It includes an overview of agglomerative and divisive approaches, as well as their respective bisection and linkage methods.

Machine Learning
Machine Learning
Clustering
Clustering
Python
Python
sklearn
sklearn
K-Means Clustering

2022-10-02

K-Means Clustering

This article discusses K-Means Clustering, a popular unsupervised machine learning technique. It covers the K-Means Algorithm's objective function and steps, choosing the right number of clusters (K) using the Elbow Method, Silhouette Method, and Gap Statistic, and implementing K-Means in Python with the Iris dataset.

Machine Learning
Machine Learning
Clustering
Clustering
Python
Python
sklearn
sklearn
Feature importance in Decision Tree

2022-08-04

Feature importance in Decision Tree

This article explores the concept of feature importance in decision trees and its various methods such as Gini impurity, information gain, and gain ratio. It discusses how these methods aid in selecting the most significant variables from a dataset and simplifying complex data. The article also demonstrates how to visualize feature importance in both regression and classification cases using Python.

Machine Learning
Machine Learning
Decision Tree
Decision Tree
sklearn
sklearn
Python
Python
Gradient Boosting Decision Trees (GBDT)

2022-08-03

Gradient Boosting Decision Trees (GBDT)

This article demystifies Gradient Boosting Decision Trees (GBDT), a powerful ensemble learning method, by diving into its algorithm, comparing it to Random Forests, and providing Python implementation examples.

Machine Learning
Machine Learning
Decision Tree
Decision Tree
sklearn
sklearn
Python
Python
Random Forests with the Titanic Dataset

2022-08-02

Random Forests with the Titanic Dataset

This article guides you through implementing a random forest classifier on the Titanic dataset. Discover how to prepare the dataset, build the model using scikit-learn, and evaluate its performance. Additionally, learn to visualize feature importance to identify significant predictors of survival.

Machine Learning
Machine Learning
Decision Tree
Decision Tree
sklearn
sklearn
Python
Python
What is Decision Tree

2022-08-01

What is Decision Tree

This article explains about decision trees, a predictive modeling tool for classification and regression problems. Uncover the process of building decision trees, including recursive binary splitting, impurity measures, and pruning techniques.

Machine Learning
Machine Learning
Decision Tree
Decision Tree
sklearn
sklearn
Permutation Importance

2022-07-02

Permutation Importance

This article covers the concept of Permutation Importance and its methodology for calculating feature importance in machine learning models.

Machine Learning
Machine Learning
Python
Python
sklearn
sklearn