Aqx598 - Senior Site Reliability Engineer - Spain Remote | (A-909)
Responder al anuncioSenior Site Reliability Engineer - Spain RemoteHi, thanks for reading about our Senior Site Reliability Engineer opportunity! We're glad you're here. We're Knack, a code-free platform used by thousands of customers — from non-profits to the world's biggest companies — to easily build custom apps, workflows, and databases. We're looking for someone to help improve our reliability and performance through deep analysis and remediation of our AWS infrastructure, monitors, alerts, and code. Please note: this is a remote role based in Spain. Key ResponsibilitiesPerform deep analysis of logs, existing systems and codebases to find opportunities to improve performance and reliability, driving execution of suggested changes. Refactor our existing monitors and alerts to be actionable and reliable, recommending and implementing diagnostic techniques and monitoring tools. Help discover correlations between customer experience and performance indicators to determine what is noticeable by customers: suggest and implement improvements based on findings. Help us to develop SLI's, SLO's, and SLA's that are impactful as they relate to our customer's experience. Help triage outages and issues across multiple teams, services, and codebases as they arise, leading root cause analysis and creating sustainable solutions to prevent and/or auto-remediate those issues in the future. Work with our QA teams to help implement automated performance and scalability testing within our CI/CD pipelines. Assist in creating reusable pipeline code, working with cloud, dev, and qa teams to help reduce complexity and deployment times. Introduce chaos engineering, promoting experimentation in production to discover and remediate systemic weaknesses and improve performance and reliability. Skills Knowledge and ExpertiseExpertise in AWS. Expertise with RDS, preferably Aurora PostgreSQL engine. Expertise with containerization. Expertise in monitoring, alerting, and logging solutions and in how to use them to enable the organization to achieve reliability and performance goals. Experience implementing, maintaining, and troubleshooting continuous integration/continuous delivery (CI/CD) tooling. Experience with implementing improvements in areas such as maintainability, scalability, availability, extensibility and security. Ability to work with many teams across disciplines (cloud, platform, development, qa, and security) to resolve issues as they arise and implement improvements. Our StackOur stack is evolving over the next year and we'd love you to be a part of that! Currently we're using:Back-end: JavaScript/TypeScript, Node. Js, ES6, GoLangData: Aurora PostgreSQL, Redis, ElasticSearchDevOps Deployment: All things AWS, Terraform (and Terraform Cloud), Jenkins, Github, Grafana, GrayLogTesting: Playwright, Mocha, JestFront-end: Vue#J-18808-Ljbffr
¡Sea el primero en responder a este anuncio de trabajo!
-
¿Por qué está buscando trabajo en Trabajas.es?
Crear alerta de empleo
Cada día nuevos anuncios de trabajo Puede elegir entre una amplia gama de trabajos: nuestro objetivo es ofrecer una selección lo más amplia posible Déjenos enviar nuevos anuncios por correo electrónico Sea el primero en responder a las nuevas ofertas de empleo Todos los anuncios de trabajos en un único lugar (de empleadores, agencias y otros portales) Todos los servicios para demandantes de empleo son gratuitos Le ayudaremos a encontrar un nuevo empleo