mediapipe

MediaPipe is a cross-platform framework for building multimodal applied machine learning pipelines

Github星跟踪图

MediaPipe

MediaPipe is a framework for building multimodal (eg. video, audio, any time series data), cross platform (i.e Android, iOS, web, edge devices) applied ML pipelines. With MediaPipe, a perception pipeline can be built as a graph of modular components, including, for instance, inference models (e.g., TensorFlow, TFLite) and media processing functions.

Real-time Face Detection

"MediaPipe has made it extremely easy to build our 3D person pose reconstruction demo app, facilitating accelerated neural network inference on device and synchronization of our result visualization with the video capture stream. Highly recommended!" - George Papandreou, CTO, Ariel AI

ML Solutions in MediaPipe

face_detection
multi-hand_tracking
hand_tracking
hair_segmentation
object_tracking

Installation

Follow these instructions.

Getting started

See mobile, desktop and Google Coral examples.

Check out some web demos (https://viz.mediapipe.dev/runner/demos/edge_detection/edge_detection.html) (https://viz.mediapipe.dev/runner/demos/face_detection/face_detection.html) (https://viz.mediapipe.dev/runner/demos/hand_tracking/hand_tracking.html)

Documentation

MediaPipe Read-the-Docs or docs.mediapipe.dev

Check out the Examples page for tutorials on how to use MediaPipe. Concepts page for basic definitions

Visualizing MediaPipe graphs

A web-based visualizer is hosted on viz.mediapipe.dev. Please also see instructions here.

Videos

Publications

Events

Community forum

  • Discuss - General community discussion around MediaPipe

Alpha Disclaimer

MediaPipe is currently in alpha for v0.6. We are still making breaking API changes and expect to get to stable API by v1.0.

Contributing

We welcome contributions. Please follow these guidelines.

We use GitHub issues for tracking requests and bugs. Please post questions to the MediaPipe Stack Overflow with a 'mediapipe' tag.

主要指标

概览
名称与所有者google-ai-edge/mediapipe
主编程语言C++
编程语言Python (语言数: 16)
平台
许可证Apache License 2.0
所有者活动
创建于2019-06-13 19:16:41
推送于2025-04-25 06:47:58
最后一次提交2025-04-24 23:40:59
发布数66
最新版本名称v0.10.23 (发布于 )
第一版名称v0.5.0 (发布于 )
用户参与
星数29.5k
关注者数518
派生数5.3k
提交数4.6k
已启用问题?
问题数5372
打开的问题数405
拉请求数201
打开的拉请求数147
关闭的拉请求数203
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?