Query SELECT `session_id` FROM `user_session_data_storage` WHERE `session_id` = ? : 
Statement could not be executed (HY000 - 145 - Table './db05/user_session_data_storage' is marked as crashed and should be repaired)
Query INSERT INTO `user_session_data_storage` SET `last_activity` = NOW(), `session_id` = ? : 
Statement could not be executed (HY000 - 145 - Table './db05/user_session_data_storage' is marked as crashed and should be repaired)
Senior Service Reliability Engineer. Job in Los Gatos, California, United States in Netflix Inc.. Nelest.com
Home  |  Contact  |  About Us
|   Register  |  Sign In

Senior Service Reliability Engineer

col-narrow-left   

Job ID:

22840

Location:

Los Gatos, CA 

Job function:

Engineering, Information Systems Management
col-narrow-right   

Posted:

03.09.2016

Employment Type:

Full time

Industry:

Entertainment / Leisure / Recreation
col-wide   

Title:

Senior Service Reliability Engineer

Job Description:

Senior Service Reliability Engineer

Los Gatos, California

They click Play and it just works...for over 70 million customers. Behind the scenes tens of thousands of globally distributed cloud instances are processing petabytes of data and billions of network requests, providing uninterrupted service alongside frequent code deployments and huge autoscaling changes. Making such a web service highly available requires that constituent microservices maintain excellent reliability. Come join the Performance and Reliability team who helps keep everything humming along. With increased growth we are looking for game-changing individuals who can take our distributed service reliability strategy to the next level. Help us drive that initiative and make future customers as thrilled as the existing ones.

Responsibilities include:

  • Develop effective tooling, alerts, and response to both identify and address reliability risks
  • Participate in on-call rotation with other teams in the Performance and Reliability Teams
  • Engage with product engineering teams to triage production outages and carry forward action items to improve ongoing reliability
  • Define and evangelize cloud-related optimizations and best practices to improve reliability and performance
Minimum Job Qualifications:

  • Ability to root cause sources of instability in a high-traffic, large-scale distributed system
  • Experience with configuration and troubleshooting of Linux, Java, Tomcat, and other middleware technologies
  • Understands large-scale complex systems from a reliability perspective
  • Scripting abilities in python, perl, or JVM-based languages
  • Passion for resolving reliability issues and identify strategies to mitigate going forward
Winning Qualities:

  • Experience with Cloud Computing platforms (particularly AWS) a plus
  • Deep network analysis experience a plus
  • Strong Linux system-level analysis capabilities

Job Requirements:

  • Ability to root cause sources of instability in a high-traffic, large-scale distributed system
  • Experience with configuration and troubleshooting of Linux, Java, Tomcat, and other middleware technologies
  • Understands large-scale complex systems from a reliability perspective
  • Scripting abilities in python, perl, or JVM-based languages
  • Passion for resolving reliability issues and identify strategies to mitigate going forward
Winning Qualities

  • Experience with Cloud Computing platforms (particularly AWS) a plus
  • Deep network analysis experience a plus
  • Strong Linux system-level analysis capabilities

Zip Code:

95030

Company Info
Netflix Inc.

Los Gatos, CA, United States

Phone:
Web Site: www.netflix.com

Company Profile

Company Info


Netflix Inc.
Los Gatos, CA, United States
Phone:
Web Site: www.netflix.com

Senior Service Reliability Engineer

col-narrow-left   

Job ID:

22840

Location:

Los Gatos, CA 

Job function:

Engineering, Information Systems Management
col-narrow-right   

Posted:

03.09.2016

Employment Type:

Full time

Industry:

Entertainment / Leisure / Recreation
col-wide   

Title:

Senior Service Reliability Engineer

Job Description:

Senior Service Reliability Engineer

Los Gatos, California

They click Play and it just works...for over 70 million customers. Behind the scenes tens of thousands of globally distributed cloud instances are processing petabytes of data and billions of network requests, providing uninterrupted service alongside frequent code deployments and huge autoscaling changes. Making such a web service highly available requires that constituent microservices maintain excellent reliability. Come join the Performance and Reliability team who helps keep everything humming along. With increased growth we are looking for game-changing individuals who can take our distributed service reliability strategy to the next level. Help us drive that initiative and make future customers as thrilled as the existing ones.

Responsibilities include:

  • Develop effective tooling, alerts, and response to both identify and address reliability risks
  • Participate in on-call rotation with other teams in the Performance and Reliability Teams
  • Engage with product engineering teams to triage production outages and carry forward action items to improve ongoing reliability
  • Define and evangelize cloud-related optimizations and best practices to improve reliability and performance
Minimum Job Qualifications:

  • Ability to root cause sources of instability in a high-traffic, large-scale distributed system
  • Experience with configuration and troubleshooting of Linux, Java, Tomcat, and other middleware technologies
  • Understands large-scale complex systems from a reliability perspective
  • Scripting abilities in python, perl, or JVM-based languages
  • Passion for resolving reliability issues and identify strategies to mitigate going forward
Winning Qualities:

  • Experience with Cloud Computing platforms (particularly AWS) a plus
  • Deep network analysis experience a plus
  • Strong Linux system-level analysis capabilities

Job Requirements:

  • Ability to root cause sources of instability in a high-traffic, large-scale distributed system
  • Experience with configuration and troubleshooting of Linux, Java, Tomcat, and other middleware technologies
  • Understands large-scale complex systems from a reliability perspective
  • Scripting abilities in python, perl, or JVM-based languages
  • Passion for resolving reliability issues and identify strategies to mitigate going forward
Winning Qualities

  • Experience with Cloud Computing platforms (particularly AWS) a plus
  • Deep network analysis experience a plus
  • Strong Linux system-level analysis capabilities

Zip Code:

95030
Copyright © 2016 NELEST.COM All rights reserved