Back to jobs
Database Reliability Engineer
Taiwan
We are looking for
- A self-driven Database Reliability Engineer with proven experience in managing large-scale database systems in production environments.
- Have a deep understanding of database architecture, performance optimization, high availability solutions, and database security best practices.
- Are following the latest industry trends in database technologies and are passionate about building reliable, scalable data infrastructure.
- Have strong automation mindset and programming skills to build tools that improve database operations and reliability.
- Understand SLI/SLO/SLA concepts and have experience implementing reliability engineering practices.
Key Responsibilities
- Work in a team of DBRE and DevOps professionals
- Design, implement, and maintain highly available database architectures to support business growth across multiple countries
- Improve existing database infrastructure, including performance tuning, capacity planning, backup/recovery strategies, and disaster recovery procedures
- Develop automation tools and internal platforms using Go/Python to streamline database operations, reduce manual interventions, and minimize human errors
- Monitor database health and performance metrics, establish alerting mechanisms based on SLO burn rates, and proactively identify potential issues
- Take ownership and responsibility for database operations including schema changes, data migrations, and production deployments
- Collaborate with development teams to provide database design consultation, query optimization, and troubleshooting support
- Conduct regular database security audits and implement security best practices to protect sensitive data
Our Stack
- Relational Databases: AWS Aurora MySQL/PostgreSQL
- NoSQL Databases: MongoDB Atlas, AWS DocumentDB
- Caching Layer: AWS ElastiCache, Valkey
- Message Queue: Apache RocketMQ, RabbitMQ
- Search Engine: ElasticSearch, Mongo Atlas
- OLAP: Redshift
- Monitoring & Alerting: Prometheus, Grafana, Loki, Alert Manager
- Backup & Recovery: AWS Backup, Point-in-Time Recovery, Cross-Region Replication
- Database Proxy: RDS Proxy
- Infrastructure as Code: Terraform, Ansible
- CI/CD Integration: Jenkins, ArgoCD, Github Action, Helm
- Network & Security: AWS VPC, Security Groups, AWS Secrets Manager
- Programming Languages: Golang, Python
- Containerization: Kubernetes
Apply for this job
*
indicates a required field
