Python Lib
Python Lib
to (Hugely) Boost
Your Data Science
Productivity
Avi Chawla
avichawla.substack.com
avichawla.substack.com
1. YellowBrick
A suite of visualization
and diagnostic tools Matplotlib Sklearn
2. PyCaret
Automate ML workflows
with this low-code
library.
avichawla.substack.com
3. imbalanced-learn
A variety of methods
to handle class
imbalance.
avichawla.substack.com
4. Modin
5. SHAP
6. Missingno
7. Prophet
Produce high-quality
forecasts on
time-series
data.
avichawla.substack.com
8. Parallel-Pandas
9. Featuretools
Automated feature
engineering for
ML models.
avichawla.substack.com
11. mlxtend
12. Vaex
13. SweetViz
14. Skorch
PyTorch Sklearn
Leverage the power of
PyTorch with the
elegance of sklearn.
avichawla.substack.com
15. Faiss
16. statsmodel
17. Pandas-Profiling
Generate a high-level
EDA report of your
data in no time.
avichawla.substack.com
18. Streamlit
19. Category-encoders
Over 15 categorical
data encoders.
avichawla.substack.com
20. DuckDB
21. PandasML
22. Pytest
An elegant testing
framework to test
your code.
avichawla.substack.com
23. Numexpr
Parallelize NumPy to
all CPU cores for
20x speedup.
avichawla.substack.com
24. CSV-Kit
25. PivotTableJS
Drap-n-drop tools to
group, pivot, plot
dataframe.
avichawla.substack.com
26. Faker
27. Icecream
28. Pyforest
29. PySnooper
30. Sidetable
Supercharge Pandas'
value_counts()
method.
Hope that
helped.
avichawla.substack.com
https://www.linkedin.com/in/avi-chawla