CVAT

强大而高效的计算机视觉标注工具(CVAT)。「Powerful and efficient Computer Vision Annotion Tool (CVAT)」

Github星跟踪图

Computer Vision Annotation Tool (CVAT)

CI
Gitter chat
Coverage Status
server pulls
ui pulls
DOI

CVAT is free, online, interactive video and image annotation
tool for computer vision. It is being used by our team to
annotate million of objects with different properties. Many UI
and UX decisions are based on feedbacks from professional data
annotation team. Try it online cvat.org.

CVAT screenshot

Documentation

Screencasts

Supported annotation formats

Format selection is possible after clicking on the Upload annotation and Dump
annotation buttons. Datumaro
dataset framework allows additional dataset transformations via its command
line tool and Python library.

For more information about supported formats look at the
documentation.

Annotation format Import Export
CVAT for images X X
CVAT for a video X X
Datumaro X
PASCAL VOC X X
Segmentation masks from PASCAL VOC X X
YOLO X X
MS COCO Object Detection X X
TFrecord X X
MOT X X
LabelMe 3.0 X X
ImageNet X X
CamVid X X
WIDER Face X X
VGGFace2 X X
Market-1501 X X
ICDAR13/15 X X

Deep learning serverless functions for automatic labeling

Name Type Framework CPU GPU
Deep Extreme Cut interactor OpenVINO X
Faster RCNN detector OpenVINO X
Mask RCNN detector OpenVINO X
YOLO v3 detector OpenVINO X
Object reidentification reid OpenVINO X
Semantic segmentation for ADAS detector OpenVINO X
Text detection v4 detector OpenVINO X
SiamMask tracker PyTorch X
f-BRS interactor PyTorch X
Inside-Outside Guidance interactor PyTorch X
Faster RCNN detector TensorFlow X X
Mask RCNN detector TensorFlow X X
RetinaNet detector PyTorch X X

Online demo: cvat.org

This is an online demo with the latest version of the annotation tool.
Try it online without local installation. Only own or assigned tasks
are visible to users.

Disabled features:

Limitations:

  • No more than 10 tasks per user
  • Uploaded data is limited to 500Mb

Prebuilt Docker images

Prebuilt docker images for CVAT releases are available on Docker Hub:

LICENSE

Code released under the MIT License.

This software uses LGPL licensed libraries from the FFmpeg project.
The exact steps on how FFmpeg was configured and compiled can be found in the Dockerfile.

FFmpeg is an open source framework licensed under LGPL and GPL.
See https://www.ffmpeg.org/legal.html. You are solely responsible
for determining if your use of FFmpeg requires any
additional licenses. Intel is not responsible for obtaining any
such licenses, nor liable for any licensing fees due in
connection with your use of FFmpeg.

Questions

CVAT usage related questions or unclear concepts can be posted in our
Gitter chat for quick replies from
contributors and other users.

However, if you have a feature request or a bug report that can reproduced,
feel free to open an issue (with steps to reproduce the bug if it's a bug
report) on GitHub* issues.

If you are not sure or just want to browse other users common questions,
Gitter chat is the way to go.

Other ways to ask questions and get our support:

Projects using CVAT

  • Onepanel is an open source
    vision AI platform that fully integrates CVAT with scalable data processing
    and parallelized training pipelines.
  • DataIsKey uses CVAT as their prime data labeling tool
    to offer annotation services for projects of any size.
  • Human Protocol uses CVAT as a way of adding annotation service to the human protocol.

主要指标

概览
名称与所有者cvat-ai/cvat
主编程语言Python
编程语言Python (语言数: 11)
平台Docker, Linux, Mac, Web browsers
许可证MIT License
所有者活动
创建于2018-06-29 14:02:45
推送于2025-04-22 14:58:14
最后一次提交2025-04-17 16:43:42
发布数106
最新版本名称v2.34.0 (发布于 )
第一版名称0.1.0 (发布于 2018-06-29 23:28:29)
用户参与
星数13.6k
关注者数183
派生数3.2k
提交数5.1k
已启用问题?
问题数4477
打开的问题数522
拉请求数3925
打开的拉请求数47
关闭的拉请求数642
项目设置
已启用Wiki?
已存档?
是复刻?
已锁定?
是镜像?
是私有?