Are you a data engineer that loves languages and language data? Come join the Applied Modeling and Data Science team in Cambridge. As a Language Data Engineer, you will be responsible for sourcing data in multiple languages. You will create platforms to source, tokenize, and query large datasets. You will implement creative and scalable solutions that turn noisy language data into readable formats, without compromise to the integrity of the language. You will set standards for formats and uncover data attributes that will enhance the ontologies that power spoken language understanding systems.
1+ years of data engineering or software development experience.
Experience with Python, SQL, and databases.
Passion and proven experience working with text data in multiple languages, either academic or professional.
Knowledge of one or more non-English languages.
Strong problem solving and troubleshooting skills.
BS/MS in Computer Science, Computational Linguistics, or a related field.
Experience with the full lifecycle of designing, developing, and maintaining software.
Experience with AWS, relational databases, and NoSQL databases.
Experience managing large data sets.
Experience turning noisy language data into consumable entities and formats.
Amazon.com is an Equal Opportunity-Affirmative Action Employer – Minority / Female / Disability / Veteran / Gender Identity / Sexual Orientation