Toronto, ON

ML Performance Engineer at AWS optimizing large language model inference on custom ML accelerators. 6+ years of experience spanning ML systems, performance optimization, and distributed systems. Proficient in ML frameworks, profiling, collective communication, distributed inference, kernel-level optimization, and hardware-level systems analysis.


Experience

ML Performance Engineer

January 2025 – Present
Amazon Web Services

Software Engineer

August 2021 – December 2024
Amazon

Data Engineer

February 2019 – August 2021
Royal Bank of Canada

Research

Attentional Guidance in Visual Search

2017
Centre for Vision Research, York University, supervised by Prof. James Elder

Lassonde Undergraduate Research Award

2016, 2017
York University

Technologies

ML & Systems: PyTorch, vLLM, NKI (Neuron Kernel Interface), AWS Neuron SDK, TensorFlow

Performance & Hardware: AWS Trainium / Inferentia, kernel optimization, roofline analysis, collective communication, distributed inference, profiling, perfetto

Languages: Python, Scala, Java, SQL, TypeScript

Data & Infrastructure: Apache Spark, Airflow, AWS CDK, Pandas, NumPy, Elasticsearch, EMR, EC2, S3, SQS, CloudWatch, DynamoDB, Aurora

Education

York University

B.Sc., Honours in Computer Science