Description
Site reliability engineers will be dedicated full-time to creating software tools, metrics and processes that improve the reliability of applications, sites, and systems in production. The Site Reliability Engineer is primarily responsible for ensuring the integrity, functionality, and reliability of applications and sites.
RESPONSIBILITIES
Develop software to detect unusual error activity. Implement workflows and processes that are designed to identify and reduce the overall number of application/system errors.
Collaborate with software development as part of the SDLC to design and implement availability, reliability, and error monitoring solutions in their applications.
Take responsibility for removing, isolating, or remediating errors, debugs, warnings or other kinds of messages from existing logs to improve overall log content and usefulness.
Limit system downtime by defining and enforcing standards for incident responses, error tracking, monitoring, and alerting with the goal to improve established reliability metrics.
Effectively respond to escalated site reliability issues any time of the day while on-call.
Conduct regular research on best practices and new technology for monitoring, alerting, error tracking and detection and application performance
Education/Certification:
Bachelor's degree in Computer Science, MIS or related field
Experience:
3+ years' experience utilizing alerting and telemetry tools such as Grafana, Prometheus, Splunk, Dynatrace and others
2+ years' experience with Splunk SPL
2+ years' experience with at least one programming language such as PHP, Python, Java, .Net
PREFERRED QUALIFICATIONS
Experience:
1+ years' experience with CI/CD
1+ years' experience with container and container orchestration such as Docker and Kubernetes
1+ years' experience with Prom
1+ years' experience with SQL
Skills/Abilities:
Troubleshooting in a large-scale networked environment
Knowledge of Paycom's applications, systems, and database
Paycom is an equal opportunity employer and prohibits discrimination and harassment of any kind. Paycom makes employment decisions on the basis of business needs, job requirements, individual qualifications and merit. Paycom wants to have the best available people in every job. Therefore, Paycom does not permit its employees to harass, discriminate or retaliate against other employees or applicants because of race, color, religion, sex, sexual orientation, gender identity, pregnancy, national origin, military and veteran status, age, physical or mental disability, genetic characteristic, reproductive health decisions, family or parental status or any other consideration made unlawful by applicable laws. Equal employment opportunity will be extended to all persons in all aspects of the employer-employee relationship. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, training, promotion, discipline, compensation benefits, and separation of employment. The Human Resources Department has overall responsibility for this policy and maintains reporting and monitoring procedures. Any questions or concerns should be referred to the Human Resources Department. ****To learn more about Paycom's affirmative action policy, equal employment opportunity, or to request an accommodation - Click on the link to find more information: paycom.com/careers/eeoc
#LI-Hybrid
RESPONSIBILITIES
Develop software to detect unusual error activity. Implement workflows and processes that are designed to identify and reduce the overall number of application/system errors.
Collaborate with software development as part of the SDLC to design and implement availability, reliability, and error monitoring solutions in their applications.
Take responsibility for removing, isolating, or remediating errors, debugs, warnings or other kinds of messages from existing logs to improve overall log content and usefulness.
Limit system downtime by defining and enforcing standards for incident responses, error tracking, monitoring, and alerting with the goal to improve established reliability metrics.
Effectively respond to escalated site reliability issues any time of the day while on-call.
Conduct regular research on best practices and new technology for monitoring, alerting, error tracking and detection and application performance
Education/Certification:
Bachelor's degree in Computer Science, MIS or related field
Experience:
3+ years' experience utilizing alerting and telemetry tools such as Grafana, Prometheus, Splunk, Dynatrace and others
2+ years' experience with Splunk SPL
2+ years' experience with at least one programming language such as PHP, Python, Java, .Net
PREFERRED QUALIFICATIONS
Experience:
1+ years' experience with CI/CD
1+ years' experience with container and container orchestration such as Docker and Kubernetes
1+ years' experience with Prom
1+ years' experience with SQL
Skills/Abilities:
Troubleshooting in a large-scale networked environment
Knowledge of Paycom's applications, systems, and database
Paycom is an equal opportunity employer and prohibits discrimination and harassment of any kind. Paycom makes employment decisions on the basis of business needs, job requirements, individual qualifications and merit. Paycom wants to have the best available people in every job. Therefore, Paycom does not permit its employees to harass, discriminate or retaliate against other employees or applicants because of race, color, religion, sex, sexual orientation, gender identity, pregnancy, national origin, military and veteran status, age, physical or mental disability, genetic characteristic, reproductive health decisions, family or parental status or any other consideration made unlawful by applicable laws. Equal employment opportunity will be extended to all persons in all aspects of the employer-employee relationship. This policy applies to all terms and conditions of employment, including, but not limited to, hiring, training, promotion, discipline, compensation benefits, and separation of employment. The Human Resources Department has overall responsibility for this policy and maintains reporting and monitoring procedures. Any questions or concerns should be referred to the Human Resources Department. ****To learn more about Paycom's affirmative action policy, equal employment opportunity, or to request an accommodation - Click on the link to find more information: paycom.com/careers/eeoc
#LI-Hybrid
Jobs from our Partners
Senior Solution Architect - Data and Analytics
Hartford, CT
US
Senior System Integration and Test Engineer
San Diego, CA
US
Critical Infrastructure Engineer
Plano, TX
US
Critical Infrastructure Engineer
Irvine, CA
US
IDT Software Engineer - Huntsville
Huntsville, AL
US
Dynamics 365 and C# Developer
Remote
US
Similar Jobs
Senior SIEM Security Engineer
Remote
Overland Park, KS
Senior Container Engineer
Remote
Overland Park, KS
Data Scientist, Risk & Fraud
Phoenix, AZ
Seattle, WA
Senior Research Engineer, Speaker Identification
Mountain View, CA
There are more than 50,000 engineering jobs:
Subscribe to membership and unlock all jobs
Engineering Jobs
50,000+ jobs from 4,500+ well-funded companies
Updated Daily
New jobs are added every day as companies post them
Refined Search
Use filters like skill, location, etc to narrow results
Become a member
🥳🥳🥳 264 happy customers and counting...
Overall, over 80% of customers chose to renew their subscriptions after the initial sign-up.
Cancel anytime / Money-back guarantee