Business Description:
Best Ring POS is an industry leader in tablet based point of sale systems used at festivals, events and venues across the United States. We develop and maintain a streamlined system that handles everything from menu customization to payment processing regardless of environment complexity. Our clients choose to work with us because of our quality of people and service and value we provide to both the client and the end user. We are looking for a Site Reliability Engineer to build out the foundation of our SRE team. You would have the opportunity to learn and develop your own vision of Site Reliability Engineering as we scale and improve our system.
Responsibilities Include:
- Efficiently handle live production incidents, debug/troubleshoot application and infrastructure issues, follow and implement SRE best practices.
- Weekend system monitoring is required during festival season.
- Monitor application performance, take steps to improve overall application performance and stability, and follow through with implementation.
- Build end-to-end monitoring infrastructure (Logging, Metrics, Tracing) and work closely with the other Production Engineers to provide the right tooling to measure the reliability of our systems.
- Collaborate with development and operations teams to ensure availability and reliability of the application and infrastructure.
- Work closely with software engineers and QAs to ensure the system is responding properly to non-functional requirements such as performance, security, and availability.
Required Experience and Skills
- 5+ years of experience in software engineering, SRE or performance engineering role.
- 5+ years of experience in cloud architecture and administration, specifically with AWS
- Experience supporting Java Spring Boot APIs and Web apps
- Strong knowledge of networking and security in a cloud environment
- Hand-on expertise with monitoring and logging tools such as CloudTrail and CloudWatch
- Good working knowledge of Linux/Unix (AWS Linux) operating systems and administration
- You will need to be proficient in at least one coding language such as Java, Javascript or Python
- Experience working with CI/CD tools, including Jenkins and GitHub.
- Excellent communication and teamwork abilities
- AWS certifications are a plus
Note: This job is hybrid remote, although we are very flexible you must live in the Texas area and be able to work on sites at least once a month.
Principals Only
Job Type: Full-time
Pay: $90,000.00 - $100,000.00 per year
Benefits:
- Dental insurance
- Flexible schedule
- Health insurance
- Professional development assistance
- Vision insurance
Compensation package:
Experience level:
Schedule:
- Monday to Friday
- On call
- Weekends as needed
Application Question(s):
- Do you live in or near the Austin, TX Area?
Experience:
- System administration: 5 years (Required)
- AWS: 5 years (Required)
- J2EE: 2 years (Required)
Work Location: In person