DevOps Engineer – Monitoring & Dashboarding Specialist
Who are we?
CFI Financial Group is an award-winning trading provider, possessing more than 25 years of experience with multiple offices around the world including London, Larnaca, Beirut, Amman, Dubai, Kuwait, Port Louis, and others.
Check out more about CFI here.
CFI is hiring! Make your mark in the online trading industry.
Are you looking to pursue a career in finance? Do you want to work with a dynamic and growing team in the exciting world of online trading and investing? If you answered yes, then we have some amazing opportunities for you!
Description:
We are seeking a highly skilled DevOps Engineer with expertise in infrastructure monitoring, observability, and dashboard development. The ideal candidate will be responsible for designing, implementing, and maintaining robust monitoring systems that ensure the reliability, performance, and scalability of our infrastructure and applications.
Key Responsibilities
- Monitoring & Observability
- Design, deploy, and manage monitoring solutions using tools such as Prometheus, Grafana, Datadog, ELK, Zabbix, or New Relic.
- Develop custom dashboards and visualization panels to track system health, application performance, and service-level indicators (SLIs).
- Implement alerting strategies to proactively detect and address system anomalies.
- Work closely with Dev, QA, and Ops teams to ensure all critical services are properly instrumented and observable.
- Automation & Infrastructure
- Automate monitoring and alerting configuration through Infrastructure-as-Code (IaC) tools (e.g., Terraform).
- Integrate monitoring pipelines into CI/CD workflows to ensure continuous visibility during deployments.
- Maintain and optimize monitoring infrastructure for performance and cost efficiency.
- Incident Response
- Support incident management processes by providing accurate telemetry and actionable insights.
- Develop and document runbooks and playbooks to standardize issue resolution.
- Participate in on-call rotations and post-incident reviews to improve observability coverage.
- Collaboration & Continuous Improvement
- Work with developers to define metrics, logs, and traces for new services.
- Continuously assess and improve monitoring frameworks to align with evolving business and technical needs.
- Provide training and documentation for internal teams on using dashboards and interpreting metrics.
Qualifications
- Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent experience).
- 3+ years of experience in a DevOps, SRE, or infrastructure engineering role.
- Proven experience with monitoring and visualization tools such as Grafana, Prometheus, Loki, ELK Stack, Datadog, or CloudWatch.
- Strong understanding of Linux systems, networking fundamentals, and cloud environments (AWS).
- Familiarity with containerization and orchestration technologies like Docker and Kubernetes.
- Knowledge of logging, tracing, and metrics best practices within observability stacks.
Why join CFI?
- We’re a fast-growing, multinational company
- Competitive salaries and benefits
- Work and learn with industry professions
- Supportive and collaborative environment
- Unlimited opportunities for growth and development
- Department
- Information Technology
- Locations
- Malaysia- Selangor
- Remote status
- Hybrid
- Employment type
- Full-time