Purpose

This place where I store whole things I know, learn and inspect about AI, ML and DA to build up these into consistent collections

General Knowledge

Landscape

Organization

Page

  • Feature Stores for ML: Collection pages about Feature stores for ML
  • Hugging Face : The Opensource AI community
  • Kaggle: Your Machine Learning and Data Science Community
  • LF AI & DATA Projects: Linux Foundation project about Open Source Innovation in Artificial Intelligence and Data
  • Made With ML: Learning how to responsibly deliver value with ML!
  • NGC Catalog: NGC Catalog - GPU Accelerated AI models and SDKs that help you infuse AI into your applications at speed of light (NVIDIA)
  • Papers With Code: The latest in Machine Learning
  • TensorFlow Hub: A repository of trained machine learning models.

Artificial Intelligence / Machine Learning

Articles

Awesome Repositories

Blogs

Topic

Youtube Channel

  • NeuralNine : Educational brand focusing on programming, machine learning and computer science
  • sentdex : Funny guy who teach you about build cool stuff with python like AI
  • MLOps.community : The MLOps Community fills the swiftly growing need to share real-world Machine Learning Operations best practices from engineers in the field

Data Analysis

Articles

Awesome Repositories

Blogs

Topic

MLOps

Articles

Awesome Repositories

Blogs

Topics

AI/ML/Data/MLOps Tools

Computer Vision

Data Orchestration Workflow

  • airbyte: The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
  • airflow: A platform to programmatically author, schedule, and monitor workflows

Labeling and Annotation

  • Argilla: a collaboration tool for AI engineers and domain experts to build high-quality datasets

LLM

  • langfuse: πŸͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets
  • litellm: Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format
  • TaskingAI: The open source platform for AI-native application development.

Models

  • trl: Train transformer language models with reinforcement learning. Hugging Face

Model Management and Serving

  • KServe: A standardΒ Model Inference PlatformΒ onΒ Kubernetes, built forΒ highly scalableΒ use cases.
  • mlflow: Open source platform for the machine learning lifecycle
  • MLServer: An open source inference server for your machine learning models.
  • ray: an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.

Streaming / CDC

  • bytewax: Python Stream Processing
  • debezium: Change data capture for a variety of databases

Toolkits

  • openvino: OpenVINOβ„’ is an open-source toolkit for optimizing and deploying AI inference
  • pachyderm: Data-Centric Pipelines and Data Versioning

Training

  • open_flamingo: An open-source framework for training large multimodal models.

VectorDB

  • Chroma: The AI-native open-source vector database (Opensource)
  • Milvus: A high-performance, highly scalable vector database that runs efficiently across a wide range of environments, from a laptop to large-scale distributed systems