Currently pursuing an MS in Computer Science at the University of Illinois Chicago, with a focus on advanced data engineering, distributed systems, and big data technologies. I hold a Bachelor’s degree in Computer Science from R.V. College of Engineering, Bangalore, and am an AWS Certified Data Engineer – Associate.
Previously, I worked as a Senior Software Engineer at Fivetran, where I specialized in building efficient ETL pipelines and high-performance database connectors. My contributions include optimizing the BigQuery data writer to significantly reduce customer costs and designing fast, reliable connectors for DynamoDB and MongoDB. I also led the development of the widely adopted Isolated Endpoint Sync (IES) framework and mentored interns in 2023.
Passionate about building scalable, reliable, and innovative data solutions, I enjoy solving complex engineering problems that help organizations make data-driven decisions. I'm actively seeking opportunities to contribute to impactful backend and data engineering teams.
Hybrid conversational API with AWS Bedrock and Ollama, gRPC Lambda triggers, and Dockerized deployment using Akka HTTP.
Benchmarked real-time streaming pipelines on AWS and GCP with Terraform, cost/latency monitoring, and carbon analysis.
Processed large-scale text using Hadoop MapReduce on AWS EMR to generate embeddings with a custom tokenizer.
Used Apache Spark on EMR to train a neural network for natural language generation using tokenized text input.
Full-stack tool for managing student-instructor help sessions with scheduling, feedback, and dashboards.
Boosted movie revenue forecasting by 15% by combining metadata with Reddit and YouTube sentiment analysis using NLP.
Experienced Backend Software Engineer with 4+ years in building scalable data solutions, optimizing database connectors, and leading Big Data projects. Proficient in distributed systems and cloud technologies, with a strong focus on delivering impactful results through advanced data engineering and analytics.
Java (Advanced), SQL (Advanced), Python, Scala, C++, Bash/Shell scripting
Data Warehouses & DatabasesSnowflake, BigQuery, Amazon Redshift, DynamoDB, MySQL, MongoDB, SQL Server, PostgreSQL
Data EngineeringETL Pipelines, Data Modeling, Schema Evolution, Connectors, Data Integration, Data Orchestration
Distributed SystemsApache Spark, Hadoop MapReduce, Apache Flink, Apache Kafka, AWS Kinesis, gRPC
EC2, Lambda, S3, EBS, Step Functions, Glue, EMR, ECS, RDS, Athena, EventBridge, IAM, KMS, CloudWatch, CloudTrail
Cloud PlatformsGCP (Compute Engine, Cloud Storage, Pub/Sub), Azure (Azure VM, Azure Blob Storage), VPC Networking
Technologies & ToolsData Lakes, Big Data Processing, Data Validation, Data Quality, Data Governance, Data Visualization
Git, GitHub, CI/CD, Docker, Kubernetes, Terraform, Tableau, dbt, RESTful API, Webhooks, Logging, Data Security, Linux
Issued by: Amazon Web Services (AWS) — July 2025
This certification validates expertise in designing, building, securing, and maintaining data analytics solutions on AWS using core services such as Glue, EMR, Kinesis, Redshift, Lambda, S3, CloudWatch, and Athena. Demonstrates advanced knowledge in data lakes, data warehousing, streaming ingestion, schema evolution, and orchestration workflows across hybrid and serverless environments.
Skills acquired include: ETL pipeline optimization, data quality monitoring, cost-effective storage strategies, performance tuning, and event-driven architecture using Step Functions & EventBridge.
Fivetran - Jan 2023 - Feb 2023 (1 month)
Led the training and development of 12 interns from various colleges, designing and implementing a comprehensive month-long onboarding program. This program included detailed sessions on essential tools such as version control systems and IDEs, in-depth overviews of company processes and best practices, and hands-on workshops focused on understanding the architecture of complex systems. Additionally, I provided personalized guidance on navigating the company’s codebase, ensuring that each intern was well-equipped to contribute effectively to ongoing projects. My efforts resulted in a smooth transition for the interns, enabling them to become productive members of the team more quickly.
Technical InterviewerFivetran - Sep 2021 - Aug 2024 (3 years)
Conducted technical interviews for Software Interns, Software Engineers (SE1 and SE2), and Test Engineers, assessing candidates' expertise in coding, data engineering, and software development. Collaborated closely with the hiring team to identify top talent, ensuring alignment with both technical standards and company culture, while contributing to the continuous improvement of the interview process.
Technical Student CoordinatorIndian Institute of Technology, Guwahati (IITG) - Jan 2019
Served as the Technical Student Coordinator for a five-day workshop on "Emerging Trends in Multi-core Processors & Network on Chip Architecture," organized by RV College of Engineering under the IEEE RVCE Student Chapter. This workshop focused on the latest advancements in multi-core processors and NoC architecture, with a special emphasis on emerging technologies in the field. My role involved organizing the event, coordinating with speakers, and managing technical sessions to ensure a smooth execution of the workshop.
Chicago, IL
sunil.kuruba.sk@gmail.com