Name: Sunil Kuruba

Education M.S in Computer Science, University of Illinois Chicago

Experience: 4 Years

Skills

🤖 Java

🤖 Data Engineering

🤖 Distributed Cloud Engineering

🤖 ETL data pipeline

🤖 Database Systems

🤖 Data Warehouses

🤖 AWS, GCP, Azure

About Me

Currently pursuing an MS in Computer Science at the University of Illinois Chicago, with a focus on advanced data engineering, distributed systems, and big data technologies. I hold a Bachelor’s degree in Computer Science from R.V. College of Engineering, Bangalore, and am an AWS Certified Data Engineer – Associate.

Previously, I worked as a Senior Software Engineer at Fivetran, where I specialized in building efficient ETL pipelines and high-performance database connectors. My contributions include optimizing the BigQuery data writer to significantly reduce customer costs and designing fast, reliable connectors for DynamoDB and MongoDB. I also led the development of the widely adopted Isolated Endpoint Sync (IES) framework and mentored interns in 2023.

Passionate about building scalable, reliable, and innovative data solutions, I enjoy solving complex engineering problems that help organizations make data-driven decisions. I'm actively seeking opportunities to contribute to impactful backend and data engineering teams.

Projects

LLM Conversation API – Bedrock & Ollama

Hybrid conversational API with AWS Bedrock and Ollama, gRPC Lambda triggers, and Dockerized deployment using Akka HTTP.

AWS vs GCP Real-Time Pipeline

Benchmarked real-time streaming pipelines on AWS and GCP with Terraform, cost/latency monitoring, and carbon analysis.

Distributed LLM Token Embedding

Processed large-scale text using Hadoop MapReduce on AWS EMR to generate embeddings with a custom tokenizer.

Spark-Based Sentence Generator

Used Apache Spark on EMR to train a neural network for natural language generation using tokenized text input.

Help Session Management System

Full-stack tool for managing student-instructor help sessions with scheduling, feedback, and dashboards.

Movie Revenue Prediction

Boosted movie revenue forecasting by 15% by combining metadata with Reddit and YouTube sentiment analysis using NLP.

Resume

Experienced Backend Software Engineer with 4+ years in building scalable data solutions, optimizing database connectors, and leading Big Data projects. Proficient in distributed systems and cloud technologies, with a strong focus on delivering impactful results through advanced data engineering and analytics.

Experience


Senior Software Engineer

Mar 2023 - Aug 2024

Fivetran - Bengaluru, India
  • Redesigned the BigQuery data writer, aligning with SQL-based standards, reducing maintenance by 90%, and streamlining new feature integration.
  • Enhanced the Warehouse Data Writer pipeline throughput by 30% via multi-threading and concurrent file appends.
  • Implemented JSON data type support in BigQuery to enable seamless data migration and flexible schema handling.
  • Reduced customer BigQuery billing by 90% using partitioning and clustering—an award-winning hackathon solution.
Software Engineer 2

Sep 2021 - Mar 2023

Fivetran - Bengaluru, India
  • Engineered a high-performance DynamoDB connector with a 15x increase in incremental sync speed.
  • Developed a 5x faster MongoDB incremental sync using Change Streams for real-time updates.
  • Introduced connector support for Azure CosmosDB (Mongo API), enabling broader platform compatibility.
  • Created the Data Preview feature with Isolated Endpoint Sync (IES), improving onboarding and reusability across connectors.
Software Engineer

Jun 2020 - Aug 2021

Fivetran - Bengaluru, India
  • Invented the Isolated Endpoint Sync (IES) framework, now adopted in 500+ connectors and integrated into critical systems.
  • Developed a public Shopify app with authentication, GraphQL extraction, and robust fault-tolerance.
  • Improved Stripe connector performance with multi-threading, connected accounts, and fault-resilient features.
  • Built a new ETL connector for ADP using REST APIs and detailed ERD documentation.
Software Engineering Intern

Jan 2020 - May 2020

Fivetran - Bengaluru, India
  • Benchmarked end-to-end ETL pipeline performance using the Snowflake warehouse.
  • Implemented webhooks for the Recharge connector, increasing sync speed by 10x.
  • Developed and tested various API-based data-pipeline connectors with RESTful interfaces.


Skills


Programming Languages

Java (Advanced), SQL (Advanced), Python, Scala, C++, Bash/Shell scripting

Data Warehouses & Databases

Snowflake, BigQuery, Amazon Redshift, DynamoDB, MySQL, MongoDB, SQL Server, PostgreSQL

Data Engineering

ETL Pipelines, Data Modeling, Schema Evolution, Connectors, Data Integration, Data Orchestration

Distributed Systems

Apache Spark, Hadoop MapReduce, Apache Flink, Apache Kafka, AWS Kinesis, gRPC

AWS Services

EC2, Lambda, S3, EBS, Step Functions, Glue, EMR, ECS, RDS, Athena, EventBridge, IAM, KMS, CloudWatch, CloudTrail

Cloud Platforms

GCP (Compute Engine, Cloud Storage, Pub/Sub), Azure (Azure VM, Azure Blob Storage), VPC Networking

Technologies & Tools

Data Lakes, Big Data Processing, Data Validation, Data Quality, Data Governance, Data Visualization

Git, GitHub, CI/CD, Docker, Kubernetes, Terraform, Tableau, dbt, RESTful API, Webhooks, Logging, Data Security, Linux

Certifications

AWS Certified Data Engineer – Associate

View Certification

Issued by: Amazon Web Services (AWS) — July 2025

This certification validates expertise in designing, building, securing, and maintaining data analytics solutions on AWS using core services such as Glue, EMR, Kinesis, Redshift, Lambda, S3, CloudWatch, and Athena. Demonstrates advanced knowledge in data lakes, data warehousing, streaming ingestion, schema evolution, and orchestration workflows across hybrid and serverless environments.

Skills acquired include: ETL pipeline optimization, data quality monitoring, cost-effective storage strategies, performance tuning, and event-driven architecture using Step Functions & EventBridge.

Leadership Experience

Onboarding Software Engineering Interns - Fivetran

Fivetran - Jan 2023 - Feb 2023 (1 month)

Led the training and development of 12 interns from various colleges, designing and implementing a comprehensive month-long onboarding program. This program included detailed sessions on essential tools such as version control systems and IDEs, in-depth overviews of company processes and best practices, and hands-on workshops focused on understanding the architecture of complex systems. Additionally, I provided personalized guidance on navigating the company’s codebase, ensuring that each intern was well-equipped to contribute effectively to ongoing projects. My efforts resulted in a smooth transition for the interns, enabling them to become productive members of the team more quickly.

Technical Interviewer

Fivetran - Sep 2021 - Aug 2024 (3 years)

Conducted technical interviews for Software Interns, Software Engineers (SE1 and SE2), and Test Engineers, assessing candidates' expertise in coding, data engineering, and software development. Collaborated closely with the hiring team to identify top talent, ensuring alignment with both technical standards and company culture, while contributing to the continuous improvement of the interview process.

Technical Student Coordinator

Indian Institute of Technology, Guwahati (IITG) - Jan 2019

Served as the Technical Student Coordinator for a five-day workshop on "Emerging Trends in Multi-core Processors & Network on Chip Architecture," organized by RV College of Engineering under the IEEE RVCE Student Chapter. This workshop focused on the latest advancements in multi-core processors and NoC architecture, with a special emphasis on emerging technologies in the field. My role involved organizing the event, coordinating with speakers, and managing technical sessions to ensure a smooth execution of the workshop.

Contact Me

Address

Chicago, IL

Email Address

sunil.kuruba.sk@gmail.com

Download Resume

Link

LinkedIn

Link



Have a Question? Click Here

Copyright © All rights reserved | This template is made with by Colorlib