Tianheng Cheng^2,3,*, Lin Song^1,📧,*, Yixiao Ge^1,🌟,2, Wenyu Liu³, Xinggang Wang^3,📧, Ying Shan^1,2
\* Equal contribution 🌟 Project lead 📧 Corresponding author ¹ Tencent AI Lab, ² ARC Lab, Tencent PCG ³ Huazhong University of Science and Technology

[![arxiv paper](https://img.shields.io/badge/Project-Page-green)](https://wondervictor.github.io/) [![arxiv paper](https://img.shields.io/badge/arXiv-Paper-red)](https://arxiv.org/abs/2401.17270)

[![demo](https://img.shields.io/badge/🤗HugginngFace-Spaces-orange)](https://huggingface.co/spaces/stevengrove/YOLO-World) [![Replicate](https://replicate.com/zsxkib/yolo-world/badge)](https://replicate.com/zsxkib/yolo-world) [![hfpaper](https://img.shields.io/badge/🤗HugginngFace-Paper-yellow)](https://huggingface.co/papers/2401.17270) [![license](https://img.shields.io/badge/License-GPLv3.0-blue)](LICENSE) [![yoloworldseg](https://img.shields.io/badge/YOLOWorldxEfficientSAM-🤗Spaces-orange)](https://huggingface.co/spaces/SkalskiP/YOLO-World) [![yologuide](https://img.shields.io/badge/📖Notebook-roboflow-purple)](https://supervision.roboflow.com/develop/notebooks/zero-shot-object-detection-with-yolo-world) [![deploy](https://media.roboflow.com/deploy.svg)](https://inference.roboflow.com/foundation/yolo_world/)

Model	Resolution	LVIS AP	LVIS-mini	COCO
AP	AP_r	AP_c	AP_f	AP	AP_r	AP_c	AP_f	AP	AP₅₀	AP₇₅
YOLO-World-S	640	18.5^+1.2	12.6	15.8	24.1	23.6^+0.9	16.4	21.5	26.6	36.6	51.0	39.7
YOLO-World-S	1280	19.7^+0.9	13.5	16.3	26.3	25.5^+1.4	19.1	22.6	29.3	38.2	54.2	41.6
YOLO-World-M	640	24.1^+0.6	16.9	21.1	30.6	30.6^+0.6	19.7	29.0	34.1	43.0	58.6	46.7
YOLO-World-M	1280	26.0^+0.7	19.9	22.5	32.7	32.7^+1.1	24.4	30.2	36.4	43.8	60.3	47.7
YOLO-World-L	640	26.8^+0.7	19.8	23.6	33.4	33.8^+0.9	24.5	32.3	36.8	44.9	60.4	48.9
YOLO-World-L	800	28.3	22.5	24.4	35.1	35.2	27.8	32.6	38.8	47.4	63.3	51.8
YOLO-World-L	1280	28.7^+1.1	22.9	24.9	35.4	35.5^+1.2	24.4	34.0	38.8	46.0	62.5	50.0
YOLO-World-X	640	28.6^+0.2	22.0	25.6	34.9	35.8^+0.4	31.0	33.7	38.5	46.7	62.5	51.0
YOLO-World-X-1280 is coming soon.

Model

Resolution

LVIS AP

LVIS-mini

COCO

AP_r

AP_c

AP_f

AP_r

AP_c

AP_f

AP₅₀

AP₇₅

YOLO-World-S

640

18.5^+1.2

12.6

15.8

24.1

23.6^+0.9

16.4

21.5

26.6

36.6

51.0

39.7

YOLO-World-S

1280

19.7^+0.9

13.5

16.3

26.3

25.5^+1.4

19.1

22.6

29.3

38.2

54.2

41.6

YOLO-World-M

640

24.1^+0.6

16.9

21.1

30.6

30.6^+0.6

19.7

29.0

34.1

43.0

58.6

46.7

YOLO-World-M

1280

26.0^+0.7

19.9

22.5

32.7

32.7^+1.1

24.4

30.2

36.4

43.8

60.3

47.7

YOLO-World-L

640

26.8^+0.7

19.8

23.6

33.4

33.8^+0.9

24.5

32.3

36.8

44.9

60.4

48.9

YOLO-World-L

800

28.3

22.5

24.4

35.1

35.2

27.8

32.6

38.8

47.4

63.3

51.8

YOLO-World-L

1280

28.7^+1.1

22.9

24.9

35.4

35.5^+1.2

24.4

34.0

38.8

46.0

62.5

50.0

YOLO-World-X

640

28.6^+0.2

22.0

25.6

34.9

35.8^+0.4

31.0

33.7

38.5

46.7

62.5

51.0

YOLO-World-X-1280 is coming soon.

Model	Resolution	Training	Data	Model Weights
YOLO-World-S	640	PT (100e)	O365v1+GoldG+CC-LiteV2	🤗 HuggingFace
YOLO-World-S	1280	CPT (40e)	O365v1+GoldG+CC-LiteV2	🤗 HuggingFace
YOLO-World-M	640	PT (100e)	O365v1+GoldG+CC-LiteV2	🤗 HuggingFace
YOLO-World-M	1280	CPT (40e)	O365v1+GoldG+CC-LiteV2	🤗 HuggingFace
YOLO-World-L	640	PT (100e)	O365v1+GoldG+CC-LiteV2	🤗 HuggingFace
YOLO-World-L	800 / 1280	CPT (40e)	O365v1+GoldG+CC-LiteV2	🤗 HuggingFace
YOLO-World-X	640	PT (100e)	O365v1+GoldG+CC-LiteV2	🤗 HuggingFace

Model

Resolution

Training

Data

Model Weights

YOLO-World-S

640

PT (100e)

O365v1+GoldG+CC-LiteV2

🤗 HuggingFace

YOLO-World-S

1280