Are you a master of words with a knack for crafting engaging scripts in Gujarati? We’re searching for a creative genius to develop dynamic content, ranging from dialogues to narrations, while building an extensive pronunciation dictionary. If you’re ready to blend creativity with precision and bring scripts to life, this is the perfect role for you!
Key Responsibilities:
Phoneme Identification and Corpus Development:
Analyze large Gujarati datasets to comprehensively identify phonemes
Construct a balanced Gujarati corpus from diverse sources, considering:
Phonetic diversity
Statistical word usage frequency
Varied utterance and sentence lengths
Phonetic Rule Establishment:
Develop and document clear grapheme-to-phoneme conversion rules for Gujarati using the International Phonetic Alphabet (IPA)
Lexicon Creation:
Generate and maintain a Gujarati lexicon with accurate phonemic representations
Continuous Improvement:
Regularly review and update the Gujarati phoneme set to ensure comprehensive coverage
Conduct n-gram analysis of Gujarati phonemes to evaluate coverage and identify gaps
Script Development:
Collaborate in creating approximately 33 hours of balanced Gujarati script content
Ensure scripts incorporate identified phonemes for optimal coverage
Quality Assurance:
Develop and document comprehensive quality procedures for all aspects of the linguistic work
Implement and maintain strict quality control measures throughout the project lifecycle
TELUS International AI Community
Our global AI Community is a vibrant network of 1 million+ contributors from diverse backgrounds who help our customers collect, enhance, train, translate, and localize content to build better AI models. Become part of our growing community and make an impact supporting the machine learning models of some of the world’s largest brands.
Qualification path
If you are meeting the basic requirements outlined below you are welcome to apply to this task and our team will reach out to you at once!
Requirements:
Native Gujarati speaker with native-level proficiency in reading, writing, and speaking
Fluency in English
18+ of age
Advanced degree in Linguistics, Computational LinguisticsExtensive knowledge of Gujarati phonetics and phonology
Proficiency in using the International Phonetic Alphabet (IPA)
Experience in corpus linguistics and natural language processing, preferably with Gujarati language data
Strong analytical skills, particularly in statistical language analysis
Familiarity with text-to-speech systems and their linguistic foundations
Excellent attention to detail and ability to manage large datasets
Strong communication skills in both Gujarati and English
Ability to work in a collaborative environment
Proven experience in developing and implementing quality procedures and documentation in linguistic projects
Commitment to maintaining the integrity of the linguistic work without relying on AI-generated content