Resume
Summary
Experienced Site Reliability Engineer with a strong background in software engineering and networking, and over 10 years of experience supporting multinational IT organizations. Passionate about building container and cloud-based solutions and applying expertise to data-oriented infrastructure challenges.
Experience
Senior DevOps Engineer
Sep 2023 — PresentCaruso Dataplace · Remote
- Led the migration to a modern, scalable infrastructure using Terraform, Terragrunt, and ArgoCD, establishing repeatable and auditable deployment workflows
- Used Terraform and Terragrunt to provision both AWS cloud resources (EKS, MSK, RDS Aurora) and production-ready platform tooling (SonarQube, SigNoz, ArgoCD) as fully codified, repeatable deployments
- Worked closely with software developers and architects to implement and optimize microservices architecture
- Collaborated with the security team to conduct internal penetration testing and proactively mitigate vulnerabilities
- Developed and maintained Kubernetes Operators in Go to streamline and automate Day-2 operations across the platform
DevOps Engineer
Mar 2023 — Sep 2023Cocus AG · Berlin, Remote
- Contributed to the design and implementation of the Hydra Project, participating in architectural discussions and delivering infrastructure components
- Implemented cloud infrastructure using CloudFormation, enabling scalable and fully automated deployments
- Contributed to the lifecycle management of backend services powered by Go
- Managed and responded to Prisma Cloud security alerts, ensuring timely investigation and resolution
- Designed and implemented AWS resource architectures to support evolving project requirements
System Development Engineer
Nov 2022 — Feb 2023Amazon · Germany, Remote
- Contributed to the design and implementation of the LTT product, taking part in technical planning and delivery discussions
- Developed end-to-end solutions for LTT project tasks, from initial concept through to production deployment
- Tested and iterated on code before and after production releases to ensure system reliability and stability
- Built working relationships with peers, new teammates, and colleagues across the business to drive project success
DevOps Engineer
Jul 2021 — Aug 2023Cocus AG · Remote
- Enhanced data filtering and processing of real-time streaming workloads using the NiFi and Kafka platform
- Used Ansible to provision packages, build Amazon Machine Images, and deploy non-destructive changes using dynamic inventories. Leveraged Terraform to provision and maintain infrastructure as code
- Supervised the deployment of core application stack software within the DevOps toolchain, improving overall system reliability
- Improved key performance indicators for incident and change management, leading to measurable gains in customer satisfaction
System Engineer
Feb 2019 — May 2021Unbelievable Machine GmbH (*um) · Berlin
- Leveraged the Ambari platform to automate Hadoop and Kafka cluster deployments and monitor their operational status
- Used Ansible and Puppet frameworks to automate cloud deployments through custom scripts and workflows
- Supervised and deployed core application stack software for load balancer systems, ensuring high availability and reliability
- Improved KPIs for incident and change management, contributing to an increase in customer satisfaction scores
System Administrator
Apr 2016 — Oct 2016PersianGig · Tehran
- Used the OpenStack platform to automate the deployment of virtual private servers and monitor their operational status
- Leveraged Puppet frameworks to automate cloud deployments through custom scripts and workflows
- Supervised and deployed core application stack software for database and web server systems
- Provided technical support to resolve customer issues and served as their primary point of contact for billing and general inquiries
- Improved incident management KPIs, contributing to higher customer satisfaction scores
Network Engineer
Jan 2015 — Mar 2016Homa Telecom · Tehran
- Collaborated with a cross-functional team of engineers while maintaining close engagement with leadership and business stakeholders to drive innovation and growth
- Configured and launched a comprehensive networking stack, including routers, load balancers, and firewalls
- Optimized wireless networking equipment to achieve maximum throughput in a highly congested radio frequency environment
- Designed and deployed a comprehensive network and hardware monitoring system using the Zabbix platform, capable of tracking and reporting a wide range of operational metrics
- Improved network connectivity stability and backbone redundancy, reducing downtime and increasing resilience
Skills
Languages
Frameworks
Tools
Platforms
Soft Skills
Education
Bachelor of Software Engineering
2005 – 2010Azad University of Shiraz · Shiraz, Iran
Courses: Operating Systems, Data Structures, Algorithms, Programming Models, Networking, Databases
Certifications
- Developing on AWS (AWS, 2022)
- Google Certified Associate Cloud Engineer (Google, 2021)
- Certified Kubernetes Administrator (CNCF, 2020)
- DevOps Tools Engineer LPI-701 (LPI, 2020)
Projects
Streamzilla
2021–2022Porsche AG
A one-stop platform for all data streaming needs within Porsche AG. An internally managed service enabling engineering product teams to build and run applications leveraging the low latency, high throughput, and fault tolerance of Apache Kafka and Apache NiFi. Designed with a cloud-agnostic approach to be highly scalable and deployable across cloud, hybrid cloud, and on-premise environments.
Legacy Database Migration
2022Fidor Bank
Migrated a legacy RDBMS cluster processing millions of daily transactions to a modern data warehouse. The new platform used an Infrastructure as Code approach with Ansible, supporting deployment across public and private cloud providers. Data consistency was ensured using Galera, while ProxySQL separated read and write nodes to prevent split-brain scenarios.
UMCP
2020–2021Unbelievable Machine
A managed private cloud platform for B2B customer services. Enabled engineering product teams to build and run applications leveraging the scalability and high availability of Red Hat OpenShift and Knative for microservices workloads. Fully automated using the Ansible framework to support configurable, large-scale management.
CNBDP
2019–2021BMW AG
A managed big data solution for data processing within BMW AG. Enabled engineering product teams to build and run applications for high-throughput data workloads. Hundreds of petabytes of data were stored on HDFS, and hundreds of gigabytes were ingested and streamed daily using Hadoop ecosystem tools including Spark, Hive, HBase, and Oozie. Adopted an Infrastructure as Code approach using Ansible and Ambari to ensure scalability across any infrastructure.
Honors & Awards
- 1st Runner-Up at OpenStack Hackathon, Berlin — 2019
- 3rd Runner-Up at DCI Hackathon, Berlin — 2018
Volunteer Experience
Brave Ambassador
Jan 2020 – PresentBrave · Global, Online
Organized events, conducted workshops, and delivered technical sessions reaching over 1,000 developers.
Tutor
Jan 2019 – PresentReDI School · Berlin, Germany
Delivered online and in-person technical and soft-skills training to over 200 students.