Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Localization request for ful_Adlm #4390

Open
1 of 3 tasks
mdoumbouya opened this issue Mar 8, 2024 · 2 comments
Open
1 of 3 tasks

Localization request for ful_Adlm #4390

mdoumbouya opened this issue Mar 8, 2024 · 2 comments
Assignees
Labels
Localisation New language requests and or issues regarding localisation (l10n)

Comments

@mdoumbouya
Copy link

mdoumbouya commented Mar 8, 2024

Welcome to the Common Voice Community !

Common Voice aims to make speech technology accessible to everyone by building an open sourced dataset of labeled voice data that is representative of languages, variants and accents spoken across the world. This template helps us to know how your language could participate in the Common Voice Project. There are three sections of this form, once you have filled out a section please click the checkbox. If you have any issues please contact commonvoice@mozilla.org.

Pontoon Set-up

To start a language on Common Voice volunteers localize our platform via Pontoon and create sentence corpus’ of cc0 text.

Language name
Fulah

Language code

Language size

50 to 80 million people

See Also:

Plural forms

eng_Latn ful_Adlm
0 rocks 𞥐 𞤨𞤫𞤼𞤫
1 rock 𞥑 𞤬𞤫𞤼𞤫𞤪𞤫
2 rocks 𞥒 𞤨𞤫𞤼𞤫
3 rocks 𞥓 𞤨𞤫𞤼𞤫
4 rocks 𞥔 𞤨𞤫𞤼𞤫
5 rocks 𞥕 𞤨𞤫𞤼𞤫
10 rocks 𞥑𞥐 𞤨𞤫𞤼𞤫
20 rocks 𞥒𞥐 𞤨𞤫𞤼𞤫
100 rocks 𞥑𞥐𞥐 𞤨𞤫𞤼𞤫
1000 rocks 𞥑𞥐𞥐𞥐 𞤨𞤫𞤼𞤫
I see 0 rocks on the ground 𞤃𞤭 𞤴𞤭𞤴𞤭 𞥐 𞤨𞤫𞤼𞤫 𞤳𞤢 𞤸𞤮𞥅𞤪𞤫 𞤤𞤫𞤴𞤣𞤭
I see 1 rock on the ground 𞤃𞤭 𞤴𞤭𞤴𞤭 𞥑 𞤬𞤫𞤼𞤫𞤪𞤫 𞤳𞤢 𞤸𞤮𞥅𞤪𞤫 𞤤𞤫𞤴𞤣𞤭
I see 10 rocks on the ground 𞤃𞤭 𞤴𞤭𞤴𞤭 𞥑𞥐 𞤨𞤫𞤼𞤫 𞤳𞤢 𞤸𞤮𞥅𞤪𞤫 𞤤𞤫𞤴𞤣𞤭
I see rocks on the ground 𞤃𞤭 𞤴𞤭𞤴𞤭 𞤨𞤫𞤼𞤫 𞤳𞤢 𞤸𞤮𞥅𞤪𞤫 𞤤𞤫𞤴𞤣𞤭

Pontoon manager
doumbouya.moussa@gmail.com
ibrahimbaben@outlook.com

Language Script

  • Script Name: Adlam
  • Script code (ISO 15924): Adlm

Sentence Collection Requirements

On the Common Voice Platform contributors on the platform read out public domain sentences generated through sentence collection. Sentence collection is a crucial part in launching languages on Common Voice. To support the equitable participation of languages of Common Voice we have introduced three new sentence collection requirements bands.

Sentence Requirement Band

  • Band A
  • Band B
  • Band C

Creating Community

Community Link
WhatsApp ADLaM Common Voice https://chat.whatsapp.com/BJkSDymkvMzGkdP0m7vQfZ
Telegram ADLaM Common Voice https://t.me/+wpEuoG9xUmc5ZjJk

Additional Optional Info

  • Why do you want to take part in Common Voice?

I would like to participate in Common Voice because I strongly believe in the significance of contributing to open-source projects dedicated to advancing speech recognition technology. By sharing my voice data in Fulah, I aim to contribute to the improvement of inclusivity and accuracy in voice recognition systems specifically tailored for speakers of my language. Engaging in Common Voice is in line with my dedication to promoting technological progress that benefits diverse linguistic communities. Furthermore, this participation can empower more individuals in my community to send text messages, ultimately making voice translation more accessible and convenient for my community. This endeavor will also help us preserve knowledge, including tales, proverbs, riddles, legends, etc., held by people who cannot read or write.

  • Would you like to have a follow up conversation regarding community building?

Yes, I would be interested in having a follow-up conversation regarding community building for the Fula language. Building a strong and supportive community is crucial for the success of projects like Common Voice, and I am eager to discuss how we can further contribute to the development and growth of our linguistic community. Please provide more details or suggest a time for the conversation, and I'll be happy to participate.

@ftyers
Copy link
Collaborator

ftyers commented Mar 28, 2024

Dear @mdoumbouya, Fulah ff is already set up in Pontoon with Common Voice enabled for translation. You can find the link here. It appears that it has been set up in Latin script rather than in Adlam script. I believe that Fulah is written in both scripts.

We can't change the script without discussion with the community (you can check the Contributors link to see who has been involved in translating so far) and get in contact with them.

However, we are working on multi-orthography support for Common Voice and the Fulah community could be a participant in the process, please take a look at this Discourse post and get in contact if you are interested.

CC @ginamoape

@ftyers ftyers added the Localisation New language requests and or issues regarding localisation (l10n) label Mar 28, 2024
@mdoumbouya
Copy link
Author

Dear @ftyers:

Yes, Fulah has been written in several scripts, including Latin, Ajami (Arabic), and Adlam.

We are not requesting to change the script. Rather we are asking to add support for Adlam.

In previous work where we had to work with different scripts for the same language, we used {ISO-15924}_{ISO-639-3} (languageCode_scriptCode) as a unique identifier: e.g. (ful_Adlm, ful_Latn, bam_Latn, bam_Nko). Could something similar work on Common Voice?

Yes, we are interested in piloting this capability with you. We have submitted the form referenced in the discourse post.

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Localisation New language requests and or issues regarding localisation (l10n)
Projects
None yet
Development

No branches or pull requests

2 participants