I am a computer architect doing researcher on memory system and domain-specific hardware.

Recent Projects at Alibaba DAMO:

Large-scale distributed GNN Trainning with Cloud FPGA:
I was the cheif architect. A cloud and software compatiable 4-FPGA prototype was delivered. Paper under submission.

Near-Data Processing for Recommendation with 3D hybrid bonding DRAM:
I was a major contributor (2nd author). Testchip on ISSCC 2022.

Previous Projects at UCSB:

Processing-In-Memory (PIM) and Near Data Processing (NDP) Architecture:
PIM/NDP DRAM and emerging NVM for applications such as deep learning, bioinformatics. Pioneer work PRIME has 1000+ citations. Publications on ISCA'16, DAC'16, MICRO'17/18/19, IEDM'17, HPCA'20 etc.

Memory Subsystem Optimization for Big Data Applications:
Memory optimizations for (dynamic) graph analytic, persistent database, blockchain, homomorphic encryption. Publications on DAC'18, CAL'18, MICRO'18, HPCA'19, MICRO'19, etc.

Non-Von Neumann Architecture for Deep Neural Network:
Algorithm-architecture co-design. Publications on ISCA'16, MICRO'16, TPDS'18, ASPLOS'19, MICRO'20, etc.

Non-volatile Processor Architecture and Chip Design for IoT:
HPCA best paper; Micro Top Pick 2016