I am a computer architect doing researcher on memory system and domain-specific hardware.
Recent Projects at Alibaba DAMO:
Large-scale distributed GNN Trainning with Cloud FPGA:
I was the cheif architect. A cloud and software compatiable 4-FPGA prototype was delivered. Paper under submission.
Near-Data Processing for Recommendation with 3D hybrid bonding DRAM:
I was a major contributor (2nd author). Testchip on ISSCC 2022.
Previous Projects at UCSB:
Processing-In-Memory (PIM) and Near Data Processing (NDP) Architecture:
PIM/NDP DRAM and emerging NVM for applications such as deep learning, bioinformatics. Pioneer work PRIME has 1000+ citations. Publications on ISCA'16, DAC'16, MICRO'17/18/19, IEDM'17, HPCA'20 etc.
Memory Subsystem Optimization for Big Data Applications:
Memory optimizations for (dynamic) graph analytic, persistent database, blockchain, homomorphic encryption. Publications on DAC'18, CAL'18, MICRO'18, HPCA'19, MICRO'19, etc.
Non-Von Neumann Architecture for Deep Neural Network:
Algorithm-architecture co-design. Publications on ISCA'16, MICRO'16, TPDS'18, ASPLOS'19, MICRO'20, etc.
Non-volatile Processor Architecture and Chip Design for IoT:
HPCA best paper; Micro Top Pick 2016