AI课堂行为分析系统:从计算机视觉到多模态融合的工程实践
2026/7/5 17:30:46
X-AnyLabeling是由 CVHub 开发的一款功能强大、支持多模态的AI 驱动自动标注工具,专为数据工程师和研究人员在工业级复杂任务中提供高效、精准的标注解决方案。
https://github.com/CVHub520/X-AnyLabeling
Computer Vision Annotation Tool (CVAT)
| 任务类型 | 代表模型 |
|---|---|
| 图像分类 | YOLOv5/8/11-Cls, InternImage, PULC |
| 目标检测 | YOLOv5–v12, YOLOX, YOLO-NAS, RT-DETR, D-FINE 等 |
| 实例分割 | YOLO-Seg 系列, RF-DETR-Seg, Hyper-YOLO-Seg |
| 姿态估计 | YOLOv8/11-Pose, DWPose, RTMO |
| 跟踪(MOT) | Bot-SORT, ByteTrack |
| 旋转检测 | YOLOv5/8/11-Obb |
| 深度估计 | Depth Anything(支持深度校准) |
| 通用分割 | SAM 1/2/3, SAM-HQ, MobileSAM, EdgeSAM 等 |
| OCR | PP-OCRv4/v5 |
| 视觉语言 | Qwen3-VL, Florence2, Gemini, ChatGPT |
| 开放词汇检测 | YOLO-World, Grounding DINO, YOLOE, CountGD |
| 图像抠图 | RMBG 1.4/2.0 |
# 通过 pip 安装(需 Python ≥ 3.8)pipinstallx-anylabeling或从源码构建(获取最新功能):
gitclone https://github.com/CVHub520/X-AnyLabeling.gitcdX-AnyLabeling pipinstall-r requirements.txt python main.py配套的X-AnyLabeling-Server支持 RESTful API,便于集成到现有标注平台或自动化流水线中。
@misc{X-AnyLabeling, year = {2023}, author = {Wei Wang}, publisher = {Github}, organization = {CVHub}, journal = {Github repository}, title = {Advanced Auto Labeling Solution with Added Features}, howpublished = {\url{https://github.com/CVHub520/X-AnyLabeling}} }