Expoint - all jobs in one place

מציאת משרת הייטק בחברות הטובות ביותר מעולם לא הייתה קלה יותר

Limitless High-tech career opportunities - Expoint

Citi Group AppDynamics - Splunk Expert L3 Support Observability SRE 
Mexico, Mexico City 
848248841

Yesterday

Responsibilities:

  • Create complex project plans, perform impact analyses, solve/work high impact problems/projects, and provide resolution to restore services
  • Provide Root Cause Analysis (RCA) post restoration of service
  • Design testing approaches, complex processes, reporting streams, and assist with the automation of repetitive tasks
  • Provide technical/strategic direction to team members
  • Review requirement documents, define hardware requirements and update processes and procedures as necessary
  • Ensure ongoing compliance with regulatory requirements
  • Responsible for applications dealing with the overall operating system
  • Conduct project related research
  • Has the ability to operate with a limited level of direct supervision.
  • Can exercise independence of judgement and autonomy.
  • Acts as SME to senior stakeholders and /or other team members.
  • Appropriately assess risk when business decisions are made, demonstrating particular consideration for the firm's reputation and safeguarding Citigroup, its clients and assets, by driving compliance with applicable laws, rules and regulations, adhering to Policy, applying sound ethical judgment regarding personal behavior, conduct and business practices, and escalating, managing and reporting control issues with transparency.


Qualifications:

  • 6-10 years of experience in roles centered around infrastructure delivery (application hosting and/or end user services) with a proven track record of operational process change and improvement
  • Understanding of how specialization within area contributes to the business and of competitors' products and services
  • Ability to develop projects required for design of metrics, analytical tools, benchmarking activities and best practices
  • Ability to work with virtual / in-person teams, and work under pressure / to a deadline
  • Experience in a Financial Services or large complex and/or global environment preferred
  • Effective written and verbal communication skills
  • Effective analytic/diagnostic skills
  • Ability to communicate technical concepts to non-technical audience


Education:

  • Bachelor’s/University degree, Master’s degree preferred


This job description provides a high-level review of the types of work performed. Other job-related duties may be assigned as required.

Main responsibilities

Detect an analyze the significance of events to our operations, software development life cycles, applications security, and end-user experience.
Implement continuous automation; automatic discovery, instrumentation, and baselining of every system component on a continuous basis shifts IT effort away from manual configuration.
Comprehensive approach to IT service management, including wide perception of changes, developments, upgrades, infrastructure, and application configurations.
Build and manage monitoring tools, develop data pipelines, sometimes investigate incidents, and most certainly troubleshoot from time to time.
Develop, implement and integrate policies, procedures, tools and resources to support the observability framework.
Gather new requirements from IT teams and market best practices to enhance the framework.
Work with product owners to ensure all technical products and services follow observability standards and policies.

Technical skills

Bachelor’s degree in computer science or equivalent combination of education and experience.
In-depth knowledge of DSLs (domain-specific languages) such as Apache Lucene, Elasticsearch/ELK Query, Splunk Search Language (SPL), Google Cloud Monitoring query Language (MQL), Prometheus PromQL, Kibana Query Language (KQL), etc.
Knowledge of configuration management, monitoring dashboards, metrics, alerts, thresholds, reports, etc.
Prior experience in working with log aggregation, metrics, distributed tracing, synthetics monitoring and real user monitoring. Proven experience working as a L3 support.
Experience working with incidents support such as ITIL or other ITSM frameworks.
Excellent troubleshooting skills.
Experience with automation using PowerShell, Python, Shell scripting or similar tech preferred.
Knowledge of data visualization.
Collect, aggregate and visualize collected metrics to provide actionable insights.
Diagnose and troubleshoot technical issues.
Intermediate English level is a must; written and spoken, be able to establish technical conversations. Advanced level is a plus.
Applications Support


Time Type:

Full time

View the " " poster. View the .

View the .

View the