Design, develop, maintain, and operate distributed software systems that incorporate real-time and streaming data for the monitoring, aggregation, and control of millions of IoT devices
Characterize complex problems related to the scalability, reliability, performance, and security of production systems
Test the performance, scalability, and reliability of software systems at scale, including developing the services to support this testing
Provide technical leadership, foster collaboration, and drive initiatives to completion
Maintain the values of the team which include engineering excellence, curiosity, a bias for action, self-awareness, inclusivity, and vulnerability
What You’ll Bring
5+ years of experience designing, building, and maintaining large-scale production systems
Experience with cloud platforms such as AWS, Azure, or Google Cloud
Proficiency with Kafka and Kubernetes
Strong systems programming skills, with fluency in Akka or Scala preferred
Expertise in building distributed, fault-tolerant services
Deep understanding of low-level operating system concepts, including multi-threading, memory management, networking, storage, performance, and scalability
Strong foundation in computer science, including data structures, algorithms, and distributed systems
Proven ability to develop and implement creative solutions