Federico Cipriani

Summary

Lifelong, passionate programmer with several years as a Software Engineer and around 5 years as a DevOps/SRE & FinOps.

I am an experienced Linux user skilled in managing large-scale production systems with a strong emphasis on critical thinking for cost management, observability, and stress testing, including chaos engineering principles. Feel free to ask me anything!

Education

Electronic Technician

Otto Krause Buenos Aires, Argentina

Space Systems Engineering

Universidad Nacional de San Martín (UNSAM) Buenos Aires, Argentina

Languages

I am an EU citizen (Italian) with native proficiency in Spanish and C2 proficiency in English.

Experience

Site Reliability Engineer

Yuno Remote/Hybrid: Buenos Aires, Argentina & Bogotá, Colombia
06/2022 - 12/2024

Terraforming multiple AWS accounts, with peering for various specific purposes.

Provisioning and maintaining several EKS clusters, alongside managing their service mesh.

Adopted Datadog, providing observability & instrumentation while controlling data ingestion costs.

Proactively met weekly to analyze cloud spending and plan FinOps automation strategies.

Engaged with cloud provider teams to assess technical and cost-fit solutions for our needs.

Designed dashboards and monitors for SLOs, collaborating on SLI definitions with various teams.

Implemented near real-time Change Data Capture (CDC) pipelines for the Data team's BI needs.

Co-led Reliability Track sessions, using Testkube to stress test and boost core product performance.

Developed several FastAPI backends to integrate Argo, AWS, Datadog, Kubernetes, Opsgenie, and Slack.

Worked with the SecOps team on compliance audits to meet consultant requirements.

Site Reliability Engineer

Agot AI Remote/Hybrid: Buenos Aires, Argentina & Pittsburgh, US
03/2021 - 05/2022

Provisioning a hybrid on-premise cloud with K3S on Nvidia Jetson.

Review and refactoring of technology stacks in both FrontEnd and BackEnd areas.

Quality control and code sanitization of all projects through GitLab CI pipeline jobs (ShellCheck, Pytest, etc).

Research and benchmarking of cloud-based Machine Learning training tools (SageMaker, MLFlow, and Kubeflow).

Inference box servers budget & compatibility planning for on-premise instances.

Software Engineer

Democracia en Red Remote: Buenos Aires, Argentina
11/2020 - 04/2021

Development of full-stack solutions with Svelte and FastAPI technologies for a civic software platform.

Integration with Blockchain Federal Argentina (BFA) Proof of Authority (PoA) network for an e-voting system.

Provisioning of resources in both DigitalOcean and Amazon Web Services (AWS) public clouds.

Prototyping, implementation, and maintenance of standardized solutions.

Software Engineer

Nevrona Remote: Buenos Aires, Argentina
04/2019 - 12/2020

Served as Lead Developer and Architect for the FrontEnd & Middleware team.

Co-led development of a comprehensive work and social networking platform from inception.

Designed and built administrative interface (backoffice) for platform management.

Authored technical documentation and established FrontEnd and general development guidelines.

Conducted technical interviews and recruitment for both BackEnd and FrontEnd teams.

Systems Administrator

APLI On-site: Buenos Aires, Argentina
05/2017 - 12/2019

Remote and on-site support of server infrastructure.

Development of scripts and utilities for the shell in Bash, Perl, and Python.

Monitoring and alarm systems with Nagios, Icinga, Nagstamon, and Prometheus.

I've built a Raspberry-based monitor wall for a Network Operation Center (NOC).

Licenses & certifications

I currently hold the following certified exams and courses:

    HashiCorp Certified

  • Terraform Associate (003).

Linux Foundation

Cloud-native & Automation

  • Certified Argo Project Associate (CAPA).
  • Certified GitOps Associate (CGOA).
  • Certified Kubernetes Administrator (CKA).
  • Certified Kubernetes Application Developer (CKAD).
  • Kubernetes and Cloud Native Associate (KCNA).
  • Kubernetes and Cloud Native Security Associate (KCSA).
  • Scaling Cloud Native Applications with KEDA (LFEL1014).

Security & Compliance

  • Ethics for Open Source Development (LFC104).
  • Developing Secure Software (LFD121).
  • Security Self-Assessments for Open Source Projects (LFEL1005).
  • Securing Projects with OpenSSF Scorecard (LFEL1006).
  • Automating Supply Chain Security - SBOMs and Signatures (LFEL1007).
  • XSS Exploits and Defenses (LFEL1010).

See the above-listed certifications and all of my other verified credentials on Credly and CertDirectory.

Skillset

Cloud & Infrastructure

Ansible, Argo, AWS Ecosystem (ACM, CE, CloudFormation, CloudWatch, CodeArtifact, EC2, ECR, ElastiCache, EKS, IAM, KMS, Lambda, RDS, Route53, S3, Secrets Manager, SQS, VPC), CI/CD, FinOps, Infrastructure as Code (IaC), Istio, GitOps, Github Actions, Helm, Kustomize, Kubernetes, K3S, KEDA, NixOS, Service Mesh, Terraform, Terragrunt, Testkube, Velero.

Monitoring & Reliability

Alertmanager, Business Continuity Plan (BCP), Chaos Mesh, Datadog, Fluentd, Grafana, Prometheus, Obersvability Instrumentation, Incident Response Management, Opsgenie, Post Mortem Writing, Chaos Engineering, RPO/RTO, Statuspage, Stress Testing.

Security & Compliance

AIDE, CIS, CVEs/CWEs, Encryption, GDPR, HIDS/NIDS, LUKS, NIST Compliance, MITRE ATT&CK, Nmap, OWASP, PCI-DSS, SAST, SBOMs, STRIDE, Security Self-Assessment, TCPDump/LibPCap, WAF, Wireshark, Zero Trust Architecture.

Data Engineering

Airbyte, CDC, DVC, DynamoDB, ElasticSearch, ELT/ETL, Kafka, Kubeflow, MLOps, MSK, MySQL, MWAA, PeerDB, PostgreSQL, SageMaker.

Programming & Frameworks

ACID, Axum, CLEAN, CQRS, Deno, Express/Polka, FastAPI, Haskell, Nix/Flakes, NodeJS, Python, Rust, Shell, Svelte/Kit, Typer, Typescript.

Work Culture & Practices

Agile, Architecture Design Records (ADRs), Follow-The-Sun (GDSE), KPI/OKR Alignment, Async Communication, Meeting Minutes & Outcome-Based Agendas.