update zsod-data.ts #443

leandstvh · 2024-01-17T20:03:04Z

added details to data.ts file for zero-shot object detection

merveenoyan

Thanks a lot for this! BTW @GuichardVictor is contributing the same page in this PR: #435 maybe you could align with this changes and contribute on top of it?

merveenoyan · 2024-01-18T20:46:40Z

packages/tasks/src/tasks/zero-shot-object-detection/about.md

@@ -0,0 +1,15 @@
+## Use Cases


I think if you're not contributing content to this page you can remove the file itself since placeholder pretty much says the same thing

merveenoyan · 2024-01-18T20:47:21Z

packages/tasks/src/tasks/zero-shot-object-detection/data.ts

+			id: "imagenet-1k"
+		}, 
+		{
+			description: "Microsoft COCO, a large-scale dataset of object instance segmentation, aims to advance object recognition by placing it in the context of scene understanding. It comprises 328k images with 2.5 million labeled instances of 91 common objects easily recognizable by a 4-year-old. ",


In1k is enough for now, we can remove coco

merveenoyan · 2024-01-18T20:48:26Z

packages/tasks/src/tasks/zero-shot-object-detection/data.ts

+	demo: {
+		inputs: [
+			{
+				filename: "zero-sh-obj-detection_1.png"


Can you upload this image by opening a PR to here? https://huggingface.co/datasets/huggingfacejs/tasks

also would be nice to stick to general convention of how we name files (you can take a look at other task pages for example)

merveenoyan · 2024-01-18T20:49:48Z

packages/tasks/src/tasks/zero-shot-object-detection/data.ts

+				type: "chart", 
+				data: [
+					{
+						label: "human face", 


Zero shot object detection doesn't return a chart but directly a class probability with bounding box, you can take a look at the example task page https://huggingface.co/tasks/object-detection (and it's files in this repository)

merveenoyan · 2024-01-18T20:50:52Z

packages/tasks/src/tasks/zero-shot-object-detection/data.ts

+	],
+	models: [	
+		{
+			description: "OWL-ViT is a zero-shot text-conditioned object detection model that uses CLIP as its multi-modal backbone to detect objects in images based on textual descriptions. It can handle multiple text queries per image and has been trained on standard detection datasets using a bipartite matching loss."


We tend to keep the descriptions shorter

merveenoyan · 2024-02-21T17:27:00Z

@leandstvh can you solve merge conflicts?

update zsod-data.ts

860a7b8

leandstvh requested review from osanseviero, SBrandeis, gary149 and Wauplin as code owners January 17, 2024 20:03

leandstvh closed this Jan 17, 2024

leandstvh reopened this Jan 17, 2024

osanseviero requested review from merveenoyan and removed request for gary149, osanseviero, Wauplin and SBrandeis January 17, 2024 20:57

merveenoyan reviewed Jan 18, 2024

View reviewed changes

coyotte508 force-pushed the main branch from 7c653d5 to 0f29277 Compare February 6, 2024 14:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

update zsod-data.ts #443

update zsod-data.ts #443

leandstvh commented Jan 17, 2024 •

edited

merveenoyan left a comment

merveenoyan Jan 18, 2024

merveenoyan Jan 18, 2024

merveenoyan Jan 18, 2024

merveenoyan Jan 18, 2024

merveenoyan Jan 18, 2024

merveenoyan Jan 18, 2024

merveenoyan commented Feb 21, 2024

update zsod-data.ts #443

Are you sure you want to change the base?

update zsod-data.ts #443

Conversation

leandstvh commented Jan 17, 2024 • edited

merveenoyan left a comment

Choose a reason for hiding this comment

merveenoyan Jan 18, 2024

Choose a reason for hiding this comment

merveenoyan Jan 18, 2024

Choose a reason for hiding this comment

merveenoyan Jan 18, 2024

Choose a reason for hiding this comment

merveenoyan Jan 18, 2024

Choose a reason for hiding this comment

merveenoyan Jan 18, 2024

Choose a reason for hiding this comment

merveenoyan Jan 18, 2024

Choose a reason for hiding this comment

merveenoyan commented Feb 21, 2024

leandstvh commented Jan 17, 2024 •

edited