Machine Learning

Articles

Awesome Repositories

Blogs

Landscape

Organization

Page

  • AI Agents Directory: Browse our AI agents list and build your digital workforce in minutes, not months 🌟 (Recommended)
  • AI Tools Directory: Access the largest list of top-quality AI tools available on the web 🌟 (Recommended)
  • DeepWiki: AI Documentation for any repository 🌟 (Recommended)
  • Feature Stores for ML: Collection pages about Feature stores for ML
  • Hugging Face : The Opensource AI community 🌟 (Recommended)
  • Kaggle: Your Machine Learning and Data Science Community 🌟 (Recommended)
  • LF AI & DATA Projects: Linux Foundation project about Open Source Innovation in Artificial Intelligence and Data 🌟 (Recommended)
  • Made With ML: Learning how to responsibly deliver value with ML!
  • NGC Catalog: NGC Catalog - GPU Accelerated AI models and SDKs that help you infuse AI into your applications at speed of light (NVIDIA)
  • Papers With Code: The latest in Machine Learning 🌟 (Recommended)
  • TensorFlow Hub: A repository of trained machine learning models.
  • There’s An AI For That: Update the new about tools and technologies about AI

Papers

Topic

Youtube Channel

  • NeuralNine : Educational brand focusing on programming, machine learning and computer science
  • sentdex : Funny guy who teach you about build cool stuff with python like AI
  • MLOps.community : The MLOps Community fills the swiftly growing need to share real-world Machine Learning Operations best practices from engineers in the field 🌟 (Recommended)

Data Engineer

center

Articles

Awesome Repositories

Blogs

Organization

  • Big Data Europe: Integrating Big Data, software & communicaties for addressing Europe’s societal challenge
  • DataTalksClub: The place to talk about data

Landscape

Topic

Youtube

MLOps

Articles

Awesome Repositories

Blogs

Code Example / Crash Course

Topics

AI/ML/Data/MLOps Tools

Big Data

  • Spark: A multi-language engine for executing data engineering, data science, and machine learning on single-node machines or clusters.

Computer Vision

  • opencv: Open Source Computer Vision Library 🌟 (Recommended)

Data Orchestration Workflow

  • airbyte: The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted. 🌟 (Recommended)

  • airflow: A platform to programmatically author, schedule, and monitor workflows 🌟 (Recommended)

    • Astronomer Registry: Building Blocks for your Apache Airflow Data Pipelines.
    • Astro: A fully-managed SaaS application for data orchestration that helps teams write and run data pipelines with Apache Airflow
  • kestra: ⚑ Workflow Automation Platform

  • prefect: A workflow orchestration framework for building resilient data pipelines in Python.

Labeling and Annotation

  • Argilla: a collaboration tool for AI engineers and domain experts to build high-quality datasets 🌟 (Recommended)

UI Builder

  • chainlit: Build Conversational AI in minutes ⚑️ 🌟 (Recommended)
  • streamlit: A faster way to build and share data apps 🌟 (Recommended)

LLM

  • langfuse: Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets 🌟 (Recommended)
  • litellm: Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format 🌟 (Recommended)
  • TaskingAI: A BaaS (Backend as a Service) platform forΒ LLM-based Agent Development and Deployment

MLOps

  • KitOps: An open source DevOps tool for packaging and versioning AI/ML models, datasets, code, and configuration into an OCI artifact.
  • polyaxon: MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle

Model Management and Serving

  • KServe: A standardΒ Model Inference PlatformΒ onΒ Kubernetes, built forΒ highly scalableΒ use cases.
  • MLFlow: Open source platform for the machine learning lifecycle 🌟 (Recommended)
  • MLServer: An open source inference server for your machine learning models.
  • openvino: OpenVINOβ„’ is an open-source toolkit for optimizing and deploying AI inference
  • Ray: an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads. 🌟 (Recommended)

VectorDB

  • Chroma: The AI-native open-source vector database (Opensource) 🌟 (Recommended)
  • Milvus: A high-performance, highly scalable vector database that runs efficiently across a wide range of environments, from a laptop to large-scale distributed systems 🌟 (Recommended)