Stars
The repository to showcase the best framework for tabular data - the Awesome CatBoost
Tutorials and examples of various recommender systems in industrial applications
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
A deep dive into embeddings starting from fundamentals
Machine Learning Engineering Open Book
Website containing illustrations about Machine Learning theory!
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.
Learn how to design systems at scale and prepare for system design interviews
📝 Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)
📝 Подборка ресурсов по машинному обучению
A compilation of main commands for scikit-learn with examples
🐙 Guides, papers, lessons, notebooks and resources for prompt engineering, context engineering, RAG, and AI Agents.
PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡
Master the command line, in one page
Data search & enrichment library for Machine Learning → Easily find and add relevant features to your ML & AI pipeline from hundreds of public and premium external data sources, including open & co…
Answers to 120 commonly asked data science interview questions.
The simplest, fastest repository for training/finetuning medium-sized GPTs.
This repo is meant to serve as a guide for Machine Learning/AI technical interviews.
Minimal Machine Learning Study Plan
Probably the best curated list of data science software in Python.
Python library for converting Scikit-Learn pipelines to PMML
Pen and paper exercises in machine learning
Curated list of data science interview questions and answers


