Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add model card, license, and alphabet for xty #14

Merged
merged 1 commit into from
Apr 20, 2022

Conversation

JEMeyer
Copy link
Contributor

@JEMeyer JEMeyer commented Apr 18, 2022

Overview

Added new model card/alphabet file/license for the Yoloxóchitl Mixtec language (xty).

Dataset was modified from https://www.openslr.org/89/ by removing chunks of records where sox would crash when trying to figure out the number of audio channels. The data was then process by commonvoice utils (https://github.com/ftyers/commonvoice-utils) to convert into the mono audio to train with Coqui.

Model adding separately: STT-SLR89-XTY-0.1

@JRMeyer JRMeyer merged commit c40ac6e into coqui-ai:main Apr 20, 2022
@JEMeyer JEMeyer deleted the jemeyer/xtyModelCard branch April 20, 2022 19:28
@serapio
Copy link

serapio commented Apr 30, 2022

Did you find a way to filter out those chunks, or was it a matter of letting sox fail and then manually remove them? Do you have a script for (most of) this process?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants