Experience
Staff Reliability Engineer — Stockholm, Sweden (Remote role)
As a Staff Engineer at Chainlink Labs, I take ownership of ensuring our success, no matter the challenge or task. I collaborate closely with some of the brightest minds in the industry, bridging the gap between theoretical design and practical implementation to meet operational and reliability demands. I actively participate in the development and design/re-design of our services, as well as hands-on implementation (coding).
Notable projects owned and delivered:
- CLL Low-Latency Oracle DDoS Resilience and Edge Authentication
- Telemetry refactor for reduced cardinality
- Various reliability and alerting-improvement projects
Notable achievements:
I was and am part of the team that prototyped, built, launched, and scaled the Chainlink Low-Latency Oracle solution from 0 to 200+ critical streams. This infrastructure now supports some of the most prominent players in the industry and secures over $1 B in value.
Sr. Site Reliability Engineer — Stockholm, Sweden (Remote role)
As a Senior Site Reliability Engineer (SRE) at Chainlink Labs, my role was to ensure the success of Chainlink Labs products with a focus on reliability. My contributions included identifying gaps, providing guidance, and taking part in implementing proposed solutions.
My engagement usually started with evaluating the reliability and maturity of a product, assessing aspects such as CI/CD setup, documentation, health checks, monitoring, alerting, metrics, logging, QA, scalability, reliability, and risk management.
Recent projects:
- End-to-end ownership and delivery of a multi-active version of our low-latency oracle solution
- Lead Reliability Architect for a new product (hybrid Web2/Web3 service)
- Creator and maintainer of an RPC Blockchain Exporter
- Company-wide SRE initiative — Service Maturity & SLOs
Technologies/tools encountered daily:
Terragrunt / Terraform, Argo CD, GitHub, GitHub Actions, Python, Go, AWS, Confluent Cloud, Kafka, Postgres, Kubernetes, Helmfile, Helm.
Site Reliability Engineering Manager — Amsterdam, Netherlands
Performed the role of Site Reliability Engineering Manager.
SRE Team Lead — Amsterdam, Netherlands
As an SRE Team Lead, my responsibilities ranged from mentoring my team as a senior engineer, day-to-day management (Scrum), hiring, and conducting performance evaluations and personal goal tracking for team members. As part of the leadership organisation, together with directors and the product-management team, I was involved in aligning team goals with company strategy, both short and long term.
Team responsibilities:
- Running and improving highly resilient and secure SaaS & PaaS platforms hosting Bloomreach Experience Manager solutions
- Providing consultancy and support during onboarding of larger clients
- Responding to platform incidents
- Building tools and automation to support fully automated roll-outs to production
- Providing architectural designs and non-functional requirements for new products and solutions
- Evangelising SRE culture within other operations teams
Projects worth mentioning:
- Architectural design for hosting the SaaS version of our PaaS solution
- SRE-paradigm transitioning
Tooling, technologies & concepts encountered daily:
Kubernetes, Helm, Docker, Ansible, CloudFormation, Python, Go, NGINX Ingress Controller, Tomcat, Java, Cloudflare, CDN, WAF, GitLab CI, Jenkins, Serverless, AWS Lambda, AWS SAM, AWS, Agile, Scrum.
Senior Cloud Operations Engineer — Amsterdam, Netherlands
As a Senior Cloud Operations Engineer, I was hired to build, operate, and improve the Bloomreach Experience Manager PaaS offering. Together with a team of six developers and engineers, my task was to bring an operational mindset to mission-critical workloads on our production systems.
Notable projects:
- Production CD-automation solution for all production clusters
- Monitoring & metrics rebuild
- Infrastructure as code
- Platform resource planning & optimisation for larger clients
- Centralised logging solution
Tooling, technologies & concepts encountered daily:
Kubernetes, Helm, Docker, Ansible, CloudFormation, Python, Go, NGINX Ingress Controller, Tomcat, Java, Cloudflare, CDN, WAF, GitLab CI, Jenkins, AWS, Prometheus, Thanos, API Gateways, Elasticsearch, Agile, Scrum.
Mission Critical Engineer — Amsterdam, Netherlands
Projects:
- Architecting and designing new highly available IT landscapes for mission-critical services (reliability, security, efficiency, automation, monitoring & metrics)
- Owning and setting CI/CD processes and automation for rolling out and developing new services, taking audit and security requirements into account
- Creating migration plans and migrating legacy services, ensuring best practices were followed
- Establishing "way-we-work" processes based on Agile & Scrum methodologies
Tooling, technologies & concepts encountered daily:
Kubernetes, Helm, Docker, CloudStack, Chef, Ansible, Terraform, Python, GitLab CI, Jenkins, AWS, GCP, PostgreSQL, MySQL.
(DevOps) System Engineer — Amsterdam, Netherlands
Served as a system engineer in a CI/CD team, working on processes, solutions, and tooling to increase quality and shorten time-to-market cycles for different product offerings across multiple platforms.
Project highlights for 2017:
- Unified SDLC implementation for back-end services
- Lead engineer on Backbase Pivotal Cloud Foundry setup on AWS
- Automation of deployment/recovery of CI/CD tools
- Lead engineer on re-deployment, migration, and upgrade of multiple CI/CD production services
- Declarative-configuration support for Backbase 6 CXP deployment solution based on Ansible
Tooling, technologies & concepts encountered daily:
Jenkins, GoCD, Maven, Gradle, Jira, Confluence, Bitbucket, Docker, Vagrant, Packer, AWS, Pivotal, Ansible, Bash, Python, CloudFormation, InfluxDB, TICK, Grafana, Agile, Scrum.
System Administrator — Amsterdam, Netherlands
Project highlights for 2016:
- Physical migration from LeaseWeb Schiphol to EvoSwitch Haarlem data centre
- Installed and configured a new VMware cluster at the HQ data centre in Amsterdam
- Migration from local to high-performance SAN storage
- Backup and disaster-recovery implementation for multiple data centres
- Network re-configuration per best practices, improving performance and security
- Migration and upgrade of the main ISP line, planning bandwidth allocation
- Deployment and setup of new branch offices
- Centralisation of authentication and authorisation for all internal services
- Upgraded network with Cisco ISR routers, improving VPN performance and efficiency
- Numerous minor improvements resulting in better efficiency, performance, and uptime
- Maintenance of Cisco enterprise network gear (ASA appliances, ISR routers, WLCs, Catalyst switches)
- Served as AWS administrator, ensuring connectivity to development environments
ICT Support Engineer — Amsterdam, Netherlands
Hired as a support engineer and grew into a SysAdmin role by proactively taking ownership of various ICT projects, predominantly in VMware-based in-house data centres and end-user support.
IT Lead — Dubai, UAE
Managed a team of IT professionals, ensuring flawless IT operations across multiple office locations, end-user support, and on-site IT-equipment maintenance in a Windows-based environment.
IT Administrator — Dubai, UAE
Managed a Microsoft-based IT landscape across two office locations and served as Salesforce administrator.