Add multiple entries in batches
- 1 What is a batch?
- 2 Overview and video tutorial
- 3 Preparing and submitting your batch
The batch upload process is currently being redesigned for the new FirstVoices. Check back later for information about the new batch process when it is ready!
What is a batch?
Members of a language team can submit multiple words or phrases in batches via a spreadsheet format (.csv), allowing many entries to be uploaded to their language site at one time. Alphabets are also uploaded in a spreadsheet format.
The spreadsheet format allows you to work offline to prepare your content, and once you are familiar with the process it can save you a lot of time.
Overview and video tutorial
To upload text-based information, for example a word, its definition, and notes, then you only need to submit a spreadsheet (.csv file) as part of your batch.
To upload media files, such as audio recordings or images, then you will also need to submit your media files to FirstVoices alongside the spreadsheet. Media files submitted to the batch upload process have strict file name requirements which you will need to review carefully.
Overall preparation steps
You will work offline to prepare a spreadsheet containing the details of all your entries, and to organize all of your media files. There are very specific requirements for how your files and data should be formatted into this spreadsheet so please carefully look over the preparation instructions.
Once you're ready, double-check the instructions, your spreadsheet, and all of your files to make sure they are in the correct format. If you are including media files, collect them into a single folder.
Then you'll submit your spreadsheet and media files to FirstVoices by email and/or via upload to Sharepoint. Once the files are submitted, inform us via email. For help getting started with this, or if you need an alternate solution due to internet speed issues, please email hello@firstvoices.com.
As long as there are no errors in your documents, the FirstVoices team will then upload your files and send you a confirmation email once it has been completed.
Attention to detail
The batch upload process does require very keen attention to detail and as such, some users prefer to add entries individually using the online form. If you like data entry and proofreading, batch uploads can be a really efficient way of uploading entries.
How many entries in a batch?
For your first batch, we require that you submit a small batch of 10-20 entries to familiarize yourself with the process before investing large amounts of time.
Subsequent batches should be 50 entries minimum.
For words and phrases, most teams opt for 100-200 entries per batch as an efficient but still manageable size. Batches can have hundreds or even thousands of entries, but it can be hard to spot or correct errors in a larger spreadsheet.
Preparing and submitting your batch
At this time, FirstVoices accepts the following content to be uploaded in a batch:
1. Before you start
As a team, decide on what information you want to include as part of your batch upload. In general, the more information you can provide, the more useful the language site will be in the future, but the longer the batch may take to prepare.
For example:
Just the basics:
word/phrase in your language, and its translation (for words/phrases)
alphabet characters and their order (for alphabet characters)
Sample audio file for each word, phrase, or alphabet character
Categories or a grammatical part of speech
Reference to a contributor or citation from a source document
2. Review file-naming requirements
If you plan to submit media files such as audio recordings, you will need to prepare your files in a specific way so that they can be accepted by our uploader. Our batch uploader is not as flexible as the FirstVoices website, and has more restricted file name requirements.
If special characters are used in your file names, they may become garbled during the batch transfer process and fail to upload correctly. For best results, file names should only use ASCII letters (a-z, A-Z), numbers (0-9), and characters like hyphen (-), underscore (_), or space.
| Might not upload: | Try this: |
---|---|---|
No diacritics or special characters! | 20200131-gṑdaŋ.wav | 20200131-go-1daN.wav Replace special letters, including all diacritics and extra symbols, with basic Roman letters, or come up with your own conventions. |
No periods, ampersand, or other symbols! | 1.ha'a~martha&bill.wav | 1_ha7a_martha-bill.wav Periods, apostrophes, and other symbols should be replaced with hyphens, underscores, or letters/numbers. |
3. Create your CSV spreadsheet
The batch spreadsheet captures the metadata and documentation about your files and is required for the batch upload. You'll need a spreadsheet application to proceed with this step.
The FirstVoices team recommends Open Office because it handles the Indigenous special characters best. You can download Open Office for free at https://openoffice.apache.org/.
Review the specific preparation instructions for the kind of batch you are working on:
From the “Resources” section of the instructions page, download a template spreadsheet and save it on your computer.
Open the template spreadsheet on your computer. If you are using Open Office, ensure that ONLY the box next to "Comma" is checked before clicking OK.
Under “Separator options”, only “Comma” should be checkedAdd, remove, or hide columns in the spreadsheet to match your anticipated workflow and the information you plan to upload.
If there are extra columns in the template that you do not plan to use, you can delete or “hide” them. For example, you can delete columns relating to image files if you are not using them.
You can add additional columns to the template if you wish. Your additional columns will not be uploaded unless the header matches a column listed in the preparation instructions.
Use caution when changing column headers. Columns that do not exactly match the expected batch format will be skipped and the data may not upload.
Enter your data into the appropriate columns below the headers.
Take great care to avoid typos. Any errors you make in the content of this spreadsheet will be uploaded to your FirstVoices language site or may affect the success of the upload.
For example, the Categories you enter in the spreadsheet must exactly match a Category on your FirstVoices site.
When you are finished and ready to save, save your spreadsheet in CSV UTF-8 (Comma delimited) (.csv) file format.
Give the file a practical name with no spaces or special characters, following the rules in step 2.
Example: denekeh_september_2019_words.csvDo NOT save as an Excel file (.xlsx) or a regular Comma Separated Values (.csv) – these cannot be processed and may break special characters in your data.
More information on saving your spreadsheet with correct Indigenous special characters can be found here: Save spreadsheets in UTF-8 CSV format
4. Prepare media files
Note: MP3 files are smaller and are the preferred choice for uploading to FirstVoices and for downloading. WAV files can be used for master copies.
To include media files in the upload, copy all the included audio, image, and video files together into one folder on your computer.
Give this folder a practical name related to your batch. The name should follow the rules in step 2 (e.g. NO SPACES.)
Example folder: denekeh_september_2019_words_audioEnsure the names of all media files follow the rules in step 2. (e.g. the only special character should be the period before the file type, such as .mp3 or .wav).
Example filename: 2018_harry_sally_word1.mp3Ensure the exact filenames of the media files are entered into your spreadsheet where it belongs, along with information such as the title.
Zip the folder to compress it for easier submission.
How to zip your media files on a Mac
Locate the folder to zip
Right-click on the folder
Click "Compress"
Find the newly created .zip archive in the same location on your computer
How to zip your media files on a PC
Locate the folder to zip.
Right-click on the folder
Click "Send to" and then click "Compressed (zipped) folder"
Find the newly created .zip archive in the same location on your computer
5. Double-check all your files and the spreadsheet
It's important to make sure your files, metadata and documentation are correct before you submit them. Take particular care to match the filenames in your spreadsheet exactly as they have been named in your folder. Any typos in filenames, categories, parts of speech, speakers, or links may affect that row being successfully uploaded.
Work with your language team to double-check and proofread all entries before sending them in. You may wish to send it to another team member, who can open it up and make sure it saved correctly.
Once the batch upload has been submitted, any errors from typos will need to be fixed individually online by your team.
6. Send to the FirstVoices team for processing
Format your spreadsheet and media folder as such:
| Format | Example |
---|---|---|
Spreadsheet | sitename_month_day_year_[keyword].csv | ktunaxa_august_2_2021_words.csv |
Zipped media | sitename_month_day_year_[keyword]_media.zip | ktunaxa_august_2_2021_words_media.zip |
If your upload is large and cannot be attached by email, you can upload these files to FPCC’s custom Sharepoint in a folder designated for your team. To set this up or get access to the link, email hello@firstvoices.com.
Once the upload is complete, email batch@fpcc.ca with your files attached, or by letting us know that you uploaded them to Sharepoint. The subject of the email should match the name of your spreadsheet – so for example, if the spreadsheet you uploaded was ktunaxa_august_2_2021_words.csv, that should be your email subject.
Please note that it may take us longer than usual to process your batch uploads, as our FirstVoices development team capacity has been significantly reduced. Our team is working diligently to get batches processed in a timely manner, and your batch will be responded to and completed as soon as possible.
To reduce the amount of time it takes for us to process batches for all teams, we would appreciate if you sent no more than 1 batch per month, rather than smaller more frequent batches. This will reduce processing time, and speed things up for everyone.
7. Review uploaded content
Once FirstVoices staff has uploaded your content, we will notify you by email.
Then, you should review the new content to make sure it uploaded correctly. For words/phrases, the entries will be visible only to your language team by default until you approve them.
Congrats, you have uploaded a batch!
Workflow Tips
Always keep your original media files in a safe, secure place. Servers and password-protected external hard drives are always good places to keep a back-up.
It's important to have a workflow that clearly identifies new entries from ones that have already been uploaded. One easy way to do this is to move files to a folder marked DONE or UPLOADED after uploading.