Amazon.com is looking for a highly motivated Network Operations Manager to help support one of the world’s largest and complex networks. Amazon Retail is a world leader in the online retail space and demands a best in class network team to ensure its reliability, availability and to meet the continuous growth in scale year after year. With Amazon Web Services (http://aws.amazon.com
), our goal is to become “The Infrastructure Platform” to the world. Our customers demand the highest quality and reliability for their services. As we expand at a tremendous rate across all of our services it is our responsibility to maintain that quality and reliability. We look for innovative ways to automate and scale our network as we expand, while driving complex issues to resolution.
As a Network Reliability manager you will be responsible for managing and developing a team of 7-10 Network Support Engineers and/or Network engineers. This team provides support for the Amazon worldwide network during local day time hours, operating as one of three global support centres on a "Follow the Sun" basis. As manager of that team you will be responsible for driving key operational improvements, metrics, change implementation and informing our automation teams’ roadmap to reduce the operational load on the team.Responsibilities:
As a manager within the Networking team you will be expected to drive operational excellence in everything we do. This includes creating sane processes and procedures to improve efficiency in our day-to-day tasks and projects. You will drive standards across the network and ensure that we are fully compliant to those standards and policies. You will work closely on supporting our internal customers and ensuring that their needs and issues are being addressed. Network Measurement
As a Network Operations manager you will be expected to drive quality into the metrics we report to assist us in focusing on the areas that give us the best ROI. This includes measurement of our issues, network capacity, vendor equipment/failures analysis and network performance, as well as continual assessment of the quality and effectiveness of our network monitoring and alarming. Technical Leadership
As a manager of a highly technical team, which has responsibility for operational availability of the Amazon global network, you will be expected to have a deep knowledge of your area. As part of your role, you will be required to review and approve network changes for your team. Additionally, you will on occasion need to develop a detailed, low-level understanding of network issues that do occur and to be able to represent those issues at operational management review meetings. Performance Management/Team Health
You will own all facets of performance and career management for the team. Regular one-on-one meetings with all team members are required. You will be expected to provide both technical and ‘soft skill’ mentoring in order to maintain a well-rounded, world-class organization. This includes project management, quality audits and coordination of training sessions with senior-level engineers as well as day-to-day oversight of the team including scheduling of a 7x8x365 operational rota. Incident/Change Management
You will be integral to developing and improving incident and change management within the Networking space. Responsibilities include driving initiatives regarding improvements to existing tools & processes and providing feedback on new practices & procedures in order to scale with the rapid expansion of the Amazon platform and customer base. Recruiting and Hiring
You will take the lead in hiring quality personnel who not only fit the needs of the current organization but also will allow the team to scale with platform and service growth. You will coordinate with Amazon and external recruiting staff to evaluate potential candidates, participate in initial phone screens and provide relevant guidance and feedback during on-site interview loops. You will also be responsible for ensuring that proper training takes place for all new hires. Automation
You will be heavily involved in setting our sister automation teams' roadmap, analyzing significant opportunities where automation can tackle volume, systemic or critical operational issues. On-call
As a member of the Networking management team, you will be expected to participate in an on-call rotation for management level escalations of Networking issues, including high-impact network events. Includes some weekends, but only during daytime hours.
This is an amazing opportunity in terms of responsibility, interesting challenges and high visibility. We truly are looking for the highest quality candidates, so you should expect a rigorous interview process.