Welcome to Pywedge documentation
What is Pywedge?
Pywedge is a pip installable Python package that intends to,
-
Make multiple interactive charts in a single line of code, to enable the user to quickly read through the charts and can make informed choices in pre-processing steps
-
Interactively preprocess the data by taking the user’s preferred choice of pre-processing techniques,
-
Make a baseline model summary, which can return ten various baseline models & predict the standout test data from selected baseline model.
-
Interactively select hyperparameters in a widget style tab, track the hyperparameters using MLFlow & predict on standout data.
Pywedge intends to help the user by quickly making charts, preprocessing the data and to rightly point out the best performing baseline model for the given dataset so that the user can spend quality time tuning such a model algorithm.
Installation
pip install pywedge
For JupyterLab, please run the following commands in anaconda prompt to enable required JupyterLab extensions to display interactive chart widget,
conda install -c conda-forge nodejs
jupyter labextension install @jupyter-widgets/jupyterlab-manager
jupyter labextension install jupyterlab-plotly@4.14.1
jupyter labextension install @jupyter-widgets/jupyterlab-manager plotlywidget@4.14.1
Mlflow is not a hard requirement in Pywedge, this is mainly to keep the pywedge light weight for the benefit of trying pywedge in web apps like Heroku. So mlflow has to be separately installed to track the hyperparameters,
pip install mlflow
Please note: Pywedge is available only in Jupyter Notebook & JupyterLab environments
Pywedge Features
-
Makes 8 different types of interactive charts with interactive axis selection widgets
-
Interactive pre-processing & 10 different baseline models
- Missing values imputation for numeric & categorical columns
- Standardization
- Feature importance
- Class oversampling using SMOTE
- Computes 10 different baseline models
-
Interactive Hyperparameter tuning & tracking hyperparameters using integrated MLFlow
-
Classification / Regression Hyperparameters tuning
- Available baseline estimators for interactive hyperparameter tuning as of now, more baseline estimators will be added soon for interactive hyperparameter tunings
Classification Regression Logistic Regression Linear Regression Decision Tree Classifier Decision Tree Regressor Random Forest Classifier Random Forest Regressor AdaBoost Classifier AdaBoost Regressor ExtraTrees Classifier ExtraTrees Regressor KNN Classifier KNN Regressor
-