Computer Vision Annotation Tool (CVAT)

CVAT is free, online, interactive video and image annotation
tool for computer vision. It is being used by our team to
annotate million of objects with different properties. Many UI
and UX decisions are based on feedbacks from professional data
annotation team. Try it online cvat.org.

CVAT screenshot

Documentation

Screencasts

Supported annotation formats

Format selection is possible after clicking on the Upload annotation and Dump
annotation buttons. Datumaro
dataset framework allows additional dataset transformations via its command
line tool and Python library.

For more information about supported formats look at the
documentation.

Annotation format	Import	Export
CVAT for images	X	X
CVAT for a video	X	X
Datumaro		X
PASCAL VOC	X	X
Segmentation masks from PASCAL VOC	X	X
YOLO	X	X
MS COCO Object Detection	X	X
TFrecord	X	X
MOT	X	X
LabelMe 3.0	X	X
ImageNet	X	X
CamVid	X	X
WIDER Face	X	X
VGGFace2	X	X
Market-1501	X	X
ICDAR13/15	X	X

Deep learning serverless functions for automatic labeling

Name	Type	Framework	CPU	GPU
Deep Extreme Cut	interactor	OpenVINO	X
Faster RCNN	detector	OpenVINO	X
Mask RCNN	detector	OpenVINO	X
YOLO v3	detector	OpenVINO	X
Object reidentification	reid	OpenVINO	X
Semantic segmentation for ADAS	detector	OpenVINO	X
Text detection v4	detector	OpenVINO	X
SiamMask	tracker	PyTorch	X
f-BRS	interactor	PyTorch	X
Inside-Outside Guidance	interactor	PyTorch	X
Faster RCNN	detector	TensorFlow	X	X
Mask RCNN	detector	TensorFlow	X	X
RetinaNet	detector	PyTorch	X	X

Online demo: cvat.org

This is an online demo with the latest version of the annotation tool.
Try it online without local installation. Only own or assigned tasks
are visible to users.

Disabled features:

Analytics: management and monitoring of data annotation team

Limitations:

No more than 10 tasks per user
Uploaded data is limited to 500Mb

Prebuilt Docker images

Prebuilt docker images for CVAT releases are available on Docker Hub:

LICENSE

Code released under the MIT License.

This software uses LGPL licensed libraries from the FFmpeg project.
The exact steps on how FFmpeg was configured and compiled can be found in the Dockerfile.

FFmpeg is an open source framework licensed under LGPL and GPL.
See https://www.ffmpeg.org/legal.html. You are solely responsible
for determining if your use of FFmpeg requires any
additional licenses. Intel is not responsible for obtaining any
such licenses, nor liable for any licensing fees due in
connection with your use of FFmpeg.

Questions

CVAT usage related questions or unclear concepts can be posted in our
Gitter chat for quick replies from
contributors and other users.

However, if you have a feature request or a bug report that can reproduced,
feel free to open an issue (with steps to reproduce the bug if it's a bug
report) on GitHub* issues.

If you are not sure or just want to browse other users common questions,
Gitter chat is the way to go.

Other ways to ask questions and get our support:

#cvat tag on StackOverflow*
Forum on Intel Developer Zone

Projects using CVAT

Onepanel is an open source
vision AI platform that fully integrates CVAT with scalable data processing
and parallelized training pipelines.
DataIsKey uses CVAT as their prime data labeling tool
to offer annotation services for projects of any size.
Human Protocol uses CVAT as a way of adding annotation service to the human protocol.

名稱與所有者	cvat-ai/cvat
主編程語言	Python
編程語言	Python (語言數: 11)
平台	Docker, Linux, Mac, Web browsers
許可證	MIT License

創建於	2018-06-29 22:02:45
推送於	2025-11-05 00:38:55
最后一次提交	2025-10-29 20:00:16
發布數	126
最新版本名稱	v2.48.1 (發布於 )
第一版名稱	0.1.0 (發布於 2018-06-30 04:28:29)

星數	14.7k
關注者數	176
派生數	3.4k
提交數	5.6k
已啟用問題?
問題數	4652
打開的問題數	537
拉請求數	4287
打開的拉請求數	48
關閉的拉請求數	687

已啟用Wiki?
已存檔?
是復刻?
已鎖定?
是鏡像?
是私有?

CVAT

Github星跟蹤圖