Update: We have published these activities with a better format over Mozilla Activate portal.
Hello everyone,
Iāve been working on series of activity bundles for Common Voice that I would like to share with you all for feedback.
This post is going to be long, so here you have my asks:
- Test the activities by yourself first and then come back here to add your feedback.
- What worked, what didnāt work?
- Are we missing anything?
- Is there another scenario you think we can design an activity? Please describe it.
My goal is that we can iterate and improve these activities during March by testing them on the ground so by the beginning of April we can post them as official activity recommendations and actively promote and encourage people to do them.
Purpose
In order for a dataset to be useful, it needs at least 2000 hours of validated voice recorded by at least 1000 different voices. Voice diversity is key: gender, age, background noise, accentsā¦
Guiding principles
We believe we can get this voice diversity from communities participation, and we would like to see around 10% of the total goal (around 200 hours) in a few locales by the end of June.
Our calculations tell us that we can get there by engaging 800 people in each language, each of them donating 15 minutes of voice (225 clips) and helping review 450 clips. In order for this to work we need at least 180 000 sentences available on the site. These numbers are optimized to avoid contributor fatigue, we assume some people will donate more and some less.
We also understand that for these donations to be engaging we should design for it, having in mind different scenarios where these donations can take place with different audiences and time availability.
Also, we want to make sure people have a basic understanding on the importance of this project to the world, userās privacy, open innovation and the value it can unlock to everyone. This will increase engagement.
Participants can also unleash a viral effect to get more people involved and promote this project, word-of-mouth and social media.
Getting people to create an account on the site and subscribe to the newsletter is something we really want, as a way to be able to follow-up with them later.
Audiences
- Individual contributors: People interested in contributing from home on their own, at their own pace.
- Small groups: Close friends or family groups that want to see in action what this project is about.
- Medium-large groups: Students, event attendees.
Proposed bundles
Lone cowboy/cowgirl
- Audience: Individual contribution
- Time needed: From few minutes to 1 hour (total)
- Outcomes: 225 clips recorded, 450 clips reviewed, optional social sharing, follow-up to next bundle, subscribe to the site.
- Material: Computer or smartphone, microphone
Visit https://voice.mozilla.org/ and create an account, make sure you subscribe to newsletter updates.
- Click on Speak
- Check the sentence on the screen
- Click on the red microphone icon
- Read the sentence out loud
- Click the red stop icon
- Repeat until you have recorded at least 5 sentences
- Click send and repeat
Now itās a good moment to share this if you like on your social media:
Help teach machines how real people speak, Iāve just donated my voice at https://voice.mozilla.org #commonvoice
Donating 15 minutes of your voice is enough, this is more less 225 clips. Once you have reached this number itās more useful if you to to the Listen section and help review other peopleās voices.
- Click on Listen
- Click Play
- Listen to the voice (try not to read the screen)
- Now read the screen and compare with what you listened
- If itās the exact same click yes, if not click no (note different accents are fine)
- Repeat
Ideally each person should reach at least 450 clip reviews, feel free to devote a few minutes each day.
Once (or as you are getting there) you have recorded 225 clips and reviewed at least 450, itās probably time for you to go to the next challenge, check the āFun with friendsā activity.
Fun with friends
- Audience: Small groups (3 to 10)
- Time needed: 30 minutes
- Outcomes: 15 min recording, 15 minutes reviewing, optional social sharing, follow-up with 3-5 people each, subscribe to the site.
- Material: One smartphone per person, ideally headphones with microphone
Gather together 3 to 10 friends or family. Briefly talk with them the concerns of the current voice-recognition systems and the privacy implications. Getting a few minutes so everyone can talk about their reactions and concerns is a good idea.
Intro them to the Common Voice project, why itās important, show them how easy is with just a smartphone to donate your voice and to review other peopleās voices.
Ask them to use their smartphones and drive them through how to create an account. Spend 15 minutes and have fun recording voices.
After that, spend 15 minutes reviewing other peopleās voices (enjoy other peopleās accents and voices!)
Invite them to talk about this in their social media accounts:
Help teach machines how real people speak, Iāve just donated my voice at https://voice.mozilla.org #commonvoice
Invite them that in the following days they run this same activity with 2-3 people they know and share the experience back with you.
Show time
- Audience: Medium-large (10-100)
- Time needed: 1 hour
- Outcomes: Intro to the project (10m), 15 min voice recordings, 30 minutes clip reviews, optional social sharing, follow-up each each participant to run āFun with friendsā, subscribe to the site.
- Material: Projector, poster, slides, computer, one smartphone per person, headphones with microphone highly encouraged
This activity is divided into four activities.
1. Intro to the project (10 minutes)
The activity owner will run a max. 10 minutes presentation (slides you can use) to introduce the project to the audience, the key elements that this should answer are:
- Whatās this project about?
- Why itās important?
- What are our goals?
- How can I be involved?
2. Voice donations (15 minutes)
Individuals will sign-in the site and with their smartphones and microphones they will start donating their voices for 15 minutes. Activity owner will control the time and ask people to move to the next activity.
3. Review time! (30 minutes)
People will make groups of two people and will go to the listen section of the site. One person of the groups will play the recording on his/her device and the other person will just listen (without reading the screen). The person listening will repeat what he/she has heard, if itās the same as the screen is showing the first person will click āYesā and move to the next one.
Activity owner will control time and ask for couples to change roles after the first 15 minutes.
4. Wrap-up and sharing (5 minutes)
The activity owner will ask participants to optionally share on their social media about this activity
Help teach machines how real people speak, Iāve just donated my voice at https://voice.mozilla.org #commonvoice
People will be also asked to make sure they subscribed to the Common Voice newsletter and check the āFun with friends activityā as something easy to do in the next few days.
Reach the crowd
- Audience: Medium-large (10-*000)
- Time needed: From 1h
- Outcomes: Get random people to donate 2-3 minutes of their time at one fixed place where crowds pass-by (like a booth), optional social sharing, follow-up with fun with friends, subscribe to the site.
- Material: A few laptops or tablets with microphones, ideally with headphones.
A set of at least 2 laptop/tablets will be set up in the fixed place with only Firefox in full-screen mode with https://voice.mozilla.org loaded.
Ideally a paper next to the device will read:
Machines should understand your voice without compromising your privacy.
Donate 3 minutes of your voice now to help [LANGUAGE]!
- Click on Speak
- Check the sentence on the screen
- Click on the red microphone icon
- Read the sentence out loud
- Click the red stop icon
- Repeat until you have recorded at least 5 sentences
- Click send
Ask anyone in this place if you need assistance or more information.
People in the fixed place should encourage people passing by to know about how they can help protect their privacy if they donate 2-3 minutes of their time.
Examples:
- Hello! How are you doing? Do you know you can help machines to understand you and protect your privacy?
- Hi! Did you know Common Voice wants to teach machines to understand us and protect our privacy?
- Hello there! Would you like machines to understand your voice without sending it to Google or Amazon?
After this initial approach we can quickly introduce people to the Common Voice project and invite them to donate 3 minutes of their time now to help their language be understood by machines in a privacy-aware way.