Spec for audio to audio #502

Wauplin · 2024-02-23T10:44:26Z

Json schema for audio-to-audio task. This is the "expected type" in the Python client but would prefer to double-check it's correct before continuing. Ping @Vaibhavs10 @osanseviero could you have a look please?

(useful for huggingface/huggingface_hub#2036)

coyotte508 · 2024-02-23T10:57:56Z

Should generate TS type with the helper command (pnpm run build for ex)

Wauplin · 2024-02-23T10:59:56Z

Should generate TS type with the helper command

Done in f80bf58.

SBrandeis

nice

packages/tasks/src/tasks/audio-to-audio/spec/input.json

SBrandeis · 2024-02-23T13:34:33Z

packages/tasks/src/tasks/audio-to-audio/spec/output.json

Is the pipeline generating several audios at once?
If not I would recommend not wrapping the output in an array for consistency with other tasks (see ImageToText for example)
https://github.com/huggingface/huggingface.js/blob/af447753043472d8ba65ab0591be7487433f7261/packages/tasks/src/tasks/image-to-text/spec/output.json#L1-L15

Yes it does. You can try on the widget in https://huggingface.co/tasks/audio-to-audio.

SBrandeis · 2024-02-23T13:35:20Z

packages/tasks/src/tasks/audio-to-audio/spec/output.json

+				"type": "string",
+				"description": "The label of the audio file."
+			},
+			"content-type": {


Maybe?
Can you link to where those parameters are documented please?

Suggested change

"content-type": {

"content_type": {

It's not documented AFAIK. I only know it from the call made by the widget on https://huggingface.co/tasks/audio-to-audio 😕

For content-type vs content_type I've added a normalizer rule in Python to accept both since the Python attribute can only be content_type (and is generated like this).

You can see some specification from https://github.com/huggingface/api-inference-community/blob/main/docker_images/speechbrain/app/pipelines/audio_to_audio.py and potential outputs

osanseviero · 2024-03-26T10:34:14Z

packages/tasks/src/tasks/audio-to-audio/inference.ts

+ *
+ * A generated audio file with its label.
+ */
+export interface AudioToAudioOutputElement {


FYI here is the returned values from community API https://github.com/huggingface/api-inference-community/blob/main/docker_images/speechbrain/app/pipelines/audio_to_audio.py#L37-L44

Wauplin added 2 commits February 23, 2024 11:27

Add spec

2ff2bc6

new spec

6e08861

Wauplin requested review from osanseviero, SBrandeis and gary149 as code owners February 23, 2024 10:44

generated-file

f80bf58

SBrandeis reviewed Feb 23, 2024

View reviewed changes

osanseviero requested a review from Vaibhavs10 February 23, 2024 13:38

Wauplin added 2 commits February 23, 2024 15:13

add parameters

bd7a46b

Merge branch 'main' into spec-for-audio-to-audio

7b98731

Wauplin requested a review from julien-c as a code owner March 12, 2024 11:14

osanseviero reviewed Mar 26, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spec for audio to audio #502

Spec for audio to audio #502

Wauplin commented Feb 23, 2024

coyotte508 commented Feb 23, 2024

Wauplin commented Feb 23, 2024

SBrandeis left a comment

SBrandeis Feb 23, 2024

Wauplin Feb 23, 2024

SBrandeis Feb 23, 2024

Wauplin Feb 23, 2024

Wauplin Feb 23, 2024

osanseviero Mar 26, 2024

osanseviero Mar 26, 2024

Spec for audio to audio #502

Are you sure you want to change the base?

Spec for audio to audio #502

Conversation

Wauplin commented Feb 23, 2024

coyotte508 commented Feb 23, 2024

Wauplin commented Feb 23, 2024

SBrandeis left a comment

Choose a reason for hiding this comment

SBrandeis Feb 23, 2024

Choose a reason for hiding this comment

Wauplin Feb 23, 2024

Choose a reason for hiding this comment

SBrandeis Feb 23, 2024

Choose a reason for hiding this comment

Wauplin Feb 23, 2024

Choose a reason for hiding this comment

Wauplin Feb 23, 2024

Choose a reason for hiding this comment

osanseviero Mar 26, 2024

Choose a reason for hiding this comment

osanseviero Mar 26, 2024

Choose a reason for hiding this comment