Common Voice Project
#KelasMozilla
Catatan: gunakan tombol spasi untuk ke halaman berikutnya :)
Intro~
Common VOice Project?
Catatan: gunakan tombol spasi untuk ke halaman berikutnya :)
Proyek Common Voice adalah inisiatif Mozilla untuk membuat perangkat yang mengajari mesin bagaimana manusia sebenarnya berbicara menjadi lebih terbuka, mudah diakses dan inklusif.
COntribution Area
CV sentence collector
It allows contributors to collect and validate sentences created by the community. You can use this tool also to import and clean up small-to-medium-sized public domain corpus you have found or collected
Voice donation & voice review
crowdsourcing open-source datasets of voices. Donate your voice, validate the accuracy of other people's clips, make the dataset better for everyone.
Common voice ID
1. Donate your voice by reading the sentence clearly
2. Validate the accuracy of donated clips, checking if the speaker read the sentence correctly
Catatan: gunakan tombol spasi untuk ke halaman berikutnya :)
How to Contribute?
1. Open https://commonvoice.mozilla.org/id do sign up/login so your donation will keep in tracked :)
2. Sta
- Sumber kalimat harus bersumber terbuka Public Domain (CC-0) license.
- Angka. There should be no digits in the source text because they can cause problems when read aloud. The way a number is read depends on context and might introduce confusion in the dataset. For example, the number “2409” could be accurately read as both “twenty-four zero nine” and “two thousand four hundred nine”.
- Singkatan dan akronim. Abbreviations and acronyms like “USA” or “ICE” should be avoided in the source text because they may be read in a way that does not coincide with their spelling. Additionally, there may be multiple accurate readings for a single abbreviation. For example, the acronym “ICE” could be pronounced “I-C-E” or as a single word.
- Tanda baca. Special symbols and punctuation should only be included when absolutely necessary. For example, an apostrophe is included in English words like “don’t” and “we’re” and should be included in the source text, but it’s unlikely you’ll ever need a special symbol like “@” or “#.”
- Huruf asing. Letters must be valid in the language being spoken. For example, “ж” is a letter in the Russian alphabet but is never used in English and so should never appear in any English source text.
- Panjang kalimat. Panjang kalimat harus kurang dari 14 kata.
HOW TO
-
Menambahkan kalimat baru
HOW TO
-
Peninjauan Kalimat (Review sentences)
- Kalimat tersebut harus memiliki ejaan yang benar
- Kalimat tersebut memiliki makna gramatikal yang benar (sesuai konteks)
-
Kalimat harus dapat dibaca
- Jika kalimat memenuhi 3 kriteria di atas, klik tombol "yes" di sebelah kanan
- Jika kalimat tidak memenuhi salah satu dari tiga kriteria di atas, klik tombol "no". Jika tidak yakin, kalimat tersebut bisa dilewati dan lanjut ke kalimat berikutnya
- Jika semua kalimat sudah selesai kamu tinjau, bantu kumpulkan lebih banyak kalimat
-
Referensi dalam meninjau kalimat
- https://kbbi.kemdikbud.go.id/
ありがとう
Terima kasih
Munches gracies
ꦩꦠꦸꦂꦤꦸꦮꦸꦤ꧀
Common Voice-ID
By lidyaa
Common Voice-ID
- 242