Hayson Cheung

Robotics · Machine Learning · Embedded AI Systems
Research Interests
SAMUEL architecture diagram

Efficient Diffusion Models as Low-Cost Synthesizers

Vocal-conditioned music generation · IEEE/WI IAT · arXiv

World model robotics inference

World Models as a Proxy for Robotics

Predictive representations for embodied agents

Robotic arm picking objects

Robotic Manipulation that Combines DL and Classical Controls

World models + visual foresight for control

Pivo project visual

DL Microfluidics Simulation as a CFD surrogate

Efficient Simulation of Flows where advection and diffusion both matter · arXiv

Industrial data visual

Industrial Predictive Modeling as a Data-Centric Challenge

SkySense — Predicting from Industrial Data at Scale

NEAT agent in Smash Bros

Exploring Legacy Neuroevolution Methods for Understanding Trained Behavior

Neuroevolution in Smash Bros · Post

Ensemble methods for robotics

Ensembling NN as a Heuristic for Robust CV

Robust inference for robotic perception

Classical NEAT Implementation for Atari Games

Neuroevolution for Atari games

Software Demos
Pendulum demo

Pendulum

OpenCV tracking + dynamics modeling

LegoFIKS demo

LegoFIKS

3D generation + instruction synthesis · Post

Candidly project screenshot

Candidly

Anti-cheat platform for interviews · Devpost

Projects & Papers
SAMUEL: Soft Alignment for Music Enhancement Paper

A lightweight latent diffusion model for vocal-conditioned musical accompaniment generation. We present soft alignment attention that adaptively combines local and global temporal dependencies across diffusion timesteps. The model achieves 220× parameter reduction and 52× faster inference (15M params) while maintaining competitive quality—enabling real-time deployment on consumer hardware.

arXiv:2507.19991 · Latent Diffusion Attention Mechanisms Efficient Inference
SkySense: Industrial-Scale ML Systems Production

End-to-end predictive maintenance pipelines deployed in manufacturing environments, with emphasis on robustness to noise, missing modalities, and out-of-distribution behavior. Focus on bridging research-grade models with real production constraints.

Production ML Reliability Data-Centric AI
Candidly: Anti-Cheat Interview Platform Project

A comprehensive platform ensuring interview integrity through gaze and typing pattern detection, combined with a video interviewing tool for candidates and a Databricks-style analytics dashboard for interviewers. Features session recording, anomaly flagging, and insights on interviewer performance.

Devpost · Computer Vision Interview Analytics Full-Stack
World Models for Robotics Research

Predictive representations for embodied agents, bridging generative modeling with real-time inference constraints in closed-loop control. Explores latent dynamics learning and planning in compressed state spaces.

Representation Learning Latent Dynamics Hardware-Aware

Generates 3D LEGO models and step-by-step role-playing instructions from input images using modern generative models and structured outputs. Demonstrates human-centered AI applications.

Devpost · 3D Generation Structured Outputs

A semantic-based matching algorithm built on AWS primitives (Lambda + embeddings + DynamoDB) developed under hackathon constraints. Demonstrates rapid prototyping with cloud services.

Devpost · Embeddings AWS

Sequence-to-sequence encoder–decoder implementation for translation tasks with LSTM bottleneck. Includes discussion of attention mechanisms as next iteration. Fully documented with interactive Colab notebook.

GitHub · Seq2Seq NLP

OpenCV-based real-time pendulum tracking with downstream analysis of period and damping behavior. Demonstrates integration of computer vision with system identification.

GitHub · Computer Vision Dynamics

NeuroEvolution of Augmenting Topologies (NEAT) implementations including self-play Pong demo and evolved Smash Bros agents. Demonstrates evolutionary optimization for agent design.

GitHub · Evolutionary Algorithms Game AI
Teaching & Mentorship

I enjoy teaching and building technical communities around tools and research:

  • Ridley College (2026): Tutoring in LaTeX and advanced mathematics. LaTeX Notes.
  • University of Toronto Machine Intelligence Student Team (2025–2026): Facilitating paper-reading sessions on state-of-the-art research.
Resources

Public Drive

Research notes, datasets, and materials

Access Drive →

Microfluidic Tool

Interactive visualization and simulation tool

Open Tool →