Job responsibilities
- Troubleshoot major incidents, facilitate blameless post-mortems, and ensure non-recurrence of issues on the ServiceNow platform or other critical applications.
- Design, code, test, and deliver software to automate manual operational tasks, enhancing the availability and stability of ServiceNow and its critical integration services.
- Analyze database and system performance issues, providing solutions to improve service availability.
- Collaborate with software engineers and teams to design, develop, test, and implement solutions for availability, reliability, and scalability.
- Implement infrastructure, configuration, and network as code for applications and platforms.
- Understand service level indicators and utilize service level objectives to proactively resolve issues before they impact customers.
- Support the adoption of site reliability engineering best practices within your team
Required qualifications, capabilities, and skills
- Formal training or certification onSite Reliabilityconcepts and 3+ years applied experience
- Extensive experience in supporting the ServiceNow Platform or other enterprise products and services, with strong debugging and troubleshooting skills for complex systems.
- Proficient in site reliability culture and principles, with hands-on experience implementing them within applications or platforms to enhance reliability and performance.
- Strong background in working with relational databases such as MySQL and Oracle, including expertise in database tuning for optimal performance.
- Skilled in diagnosing application performance degradation and implementing effective solutions to improve system efficiency.
- Proficient in scripting languages including JavaScript, Python, Perl, Unix Shell, and Windows Shell, with experience in writing or debugging Object-Oriented code, particularly in Java.
- Experienced in working with dynamic HTML components such as AJAX, JavaScript, AngularJS, CSS, XML, HTML, and XHTML to create interactive and responsive web applications.
- Proficient in web services technologies, including SOAP and REST, as well as data extraction technologies like JDBC and ODBC, for seamless data integration and communication.
- Knowledgeable in TCP/IP networking, IT Service Management (ITSM), IT Infrastructure Library (ITIL), and Configuration Management Database (CMDB), with experience in observability to monitor and improve system performance.
Preferred qualifications, capabilities, and skills
- Ability to solve unique problems in areas such as compute services, including Containers & Serverless, and other AWS Services.
- Strong understanding of AWS Network Architecture and general networking.
- ServiceNow Certification; CSA or higher is desired.