Amazon

Returning Candidate?

Manager, Network Operations

Manager, Network Operations

Job ID 
529854
Location 
US-WA-Seattle
Posted Date 
9/20/2017

Job Description

Amazon’s network is a key differentiator for Amazon Web Services (AWS), enabling the global operation of thousands of applications across hundreds of thousands of servers worldwide. The AWS Networking team develops and operates the network platform for all of Amazon, including e-commerce products and cloud computing solutions. This platform is industry-leading for its efficiency, throughput and reliability, and it is critical to the success of hundreds of thousands of AWS customers. Are you ready to own the network availability for the largest cloud network on the planet? With Amazon Web Services (http://aws.amazon.com), our goal is to become “The Infrastructure Platform” to the world. Our customers demand the highest quality and reliability for their services. As we expand at a tremendous rate across all of our services, it is our responsibility to maintain that quality and reliability. We look for innovative ways to automate the operation of our network and drive complex issues to resolution.

As part of this Network Operations Manager role, a successful candidate will be responsible for managing one of our global teams aligned with one of our network fabrics. Direct reports will include Network Development Engineers located in multiple regions who are in turn responsible for teams of highly qualified network engineers who own responding to large-scale operational events, escalations from our tier-1 fault-handling systems, and automation of all network fault-handling. The Network Operations team is expected to identify and mitigate any network problem that could lead to customer impact as quickly as possible. The team is also responsible for deep-dive and root-cause analysis of events which impact our customers ensuring that we improve our processes, alarming, and automation in order to avoid customer impact in the future. AWS leaders are highly-technical and understand their environment to accurately represent operational issues and exercise high judgment during high severity events. As the manager of one of our Network Operations teams, successful candidates will be responsible for setting and delivering on the strategic vision for the team.

Our engineers, managers and leaders are innovators at heart; come join us and become integral to the technology company that is the past, present and future of real Cloud Computing.


Responsibilities:


Operational Excellence

As a manager within the Networking team you will be expected to drive operational excellence in everything we do. This includes creating sane processes and procedures to improve efficiency in our day-to-day tasks and projects. You will drive standards across the network and ensure that we are fully compliant to those standards and policies. You will work closely on supporting our internal customers and ensuring that their needs and issues are being addressed.

Network Measurement
As a Network Operations manager you will be expected to drive quality into the metrics we report to assist us in focusing on the areas that give us the best ROI. This includes measurement of our issues, network capacity, vendor equipment/failures analysis and network performance.

Performance Management/Team Health
You will own all facets of performance and career management for the team. Regular one-on-one meetings with all team members are required. You will be expected to provide both technical and ‘soft skill’ mentoring in order to maintain a well-rounded, world-class organization. This includes project management, quality audits and coordination of training sessions with senior-level engineers as well as day-to-day oversight of the team including scheduling of a 8x7x365 operational rota.

Incident/Change Management
You will be integral to developing and improving incident and change management within the Networking space. Responsibilities include driving initiatives regarding improvements to existing tools & processes and providing feedback on new practices & procedures in order to scale with the rapid expansion of the Amazon platform and customer base.

Recruiting and Hiring
You will take the lead in hiring quality personnel who not only fit the needs of the current organization but also will allow the team to scale with platform and service growth. You will coordinate with Amazon and external recruiting staff to evaluate potential candidates, participate in initial phone screens and provide relevant guidance and feedback during on-site interview loops. You will also be responsible for ensuring that proper training takes place for all new hires.


Automation

You will be heavily involved in driving the team to analyze operational events in order to identify new automation opportunities and help us achieve our vision of all tier-1 faults in the network being fully remediated by software. This will include helping our software teams in Network Operations understand our requirements and drive their roadmaps to ensure that our network engineers are able to directly create new automation tasks directly within those software frameworks rather than relying on other teams to deliver them.


Goal Setting and Delivery
You will directly own defining and delivering strategic goals for your team and for the wider NetOps team. As part of this responsibility, you will be expected to communicate goals to the team, align the necessary resources, and communicate status to the wider leadership team.


Oncall
As a member of the Networking management team, you will be expected to participate in an escalation oncall rotation for all Networking issues during the daytime hours in your location, including high-impact network events. The primary role of this rotation is to ensure that high-severity events have all necessary resources applied to solve the issue as quickly as possible.

This is an amazing opportunity in terms of responsibility, interesting challenges and high visibility. We truly are looking for the highest quality candidates, so you should expect a rigorous interview process.

Basic Qualifications


Must have a high degree of organization and be very detail-oriented. Must be able to interact with and influence people at all levels and have excellent public speaking skills. Must have the ability to contribute to and support long-term visions and direction regarding Networking at Amazon. Experience in building and managing a team of strong technical people, and prior ownership of the operation of a mission-critical team is crucial to success. Experience managing multiple teams or across multiple geographies is also desirable. The successful candidate will have a proven track record of success in driving operational excellence, including coordinating and driving issues to resolution autonomously utilizing excellent judgment skills.

A B.S. in Computer Science or four years of equivalent experience in a large-scale enterprise environment is required. Experience in Network Engineering is highly recommended.

Preferred Qualifications

Key Skills
  • Strong leadership skills
  • Strong interpersonal skills
  • Strong mentoring skills
  • Strong presentational skills
  • Ability to cope under pressure
  • Ability to understand and drive service management/SLAs
  • Ability to coordinate multiple activities and work in a high pace environment