Amazon EMR (http://aws.amazon.com/emr) is an AWS service that makes it easy for customers to run their big data workloads. EMR supports well-known big data platforms like Hadoop and Spark, and multiple applications that are part of this ecosystem, like Hive, Presto, Pig and Hue.
The EMR Release team is looking for senior, experienced, passionate and talented engineers to innovate in the rapidly growing area of big data and integrate the latest in big data technology applications into a massively scaled distributed service.
Come join the team that tackles the complexities of build, integration and maintenance while providing a reliable, quality stream of releases for today and emerging applications (Spark, Tez, Presto, Flink, etc.) in the big data space.
You: · Are passionate about distributed processing and massive scaled applications · Enjoy using and participating in development of open source applications · Enjoy working with the latest software technologies and paradigms · Are eager to be an early adopter of emerging big data applications · Savor the challenges of making diverse technologies work through the entire stack · Are comfortable with continuous integration workflows and tools like git, jenkins and gradle · Are comfortable with Java and digging into any programming language in order to understand the internals of an application · Are at ease with Linux troubleshooting and bash scripting and can do wonders with a single command · Are comfortable with OS-specific packaging such as rpm/dpkg
· MS Computer Science or other relevant technical degree and/or related experience. · Experience with Java or Scala build systems: maven, ant, sbt and gradle. · Experience with multiple Apache Hadoop / Spark ecosystem applications, like: Hadoop, Hive, Oozie, Pig, Presto, Hue, Spark, Tachyon, Zeppelin · Java development experience. · Big data application development experience. · Recognized subject matter expertise in Hadoop or Spark and at least one other complimentary application. · Experience building and operating at scale · Commits or contribution via code or technical guidance to Apache Hadoop, Spark or related big data projects. · Significant contribution to development for applications or services with a large user base · Strong understanding of distributed systems and distributed computation. · Deployment automation experience with scripting, chef, puppet, etc. · Linux and RPM packaging experience. · Ability to communicate comfortably, at different levels, with different stakeholders · Excellent communication and collaboration skills
· Experience with Apache Hadoop ecosystem applications: Hadoop, Hive, Oozie, Presto, Hue, Spark, Zeppelin and more! · Strong Java development experience. · Experience building and operating at scale on AWS · Commits or contribution via code or technical guidance to Apache Hadoop, Spark or related big data projects. · Significant contribution to development for applications or services with a large user base