About
I am Raghu Raja1, currently a Principal Engineer in AWS (Annapurna Labs). I work on networking technologies for Machine Learning accelerators. Prior to this, I was an architect at Enfabrica - a stealth startup, where I was leading the Machine Learning software ecosystem development. Before that, I spent about four years with AWS HPC organization, where I was the Technical Lead for a team building software for the Elastic Fabric Adapter. I helped develop libfabric and was the maintainer for the EFA libfabric provider. Before AWS, I was a Senior Engineer in Cray’s Storage R&D organization, working on strategic path-finding projects (such as this) targeting future-generation supercomputers as well as tactical feature development for current-generation systems.
I went to The Ohio State University (Go Bucks!) for graduate school. My dissertation, advised by D. K. Panda and done in collaboration with Lawrence Livermore National Laboratory, covered the intersection of two key aspects of supercomputing systems — scalable networking and efficient parallel I/O.
While I no longer actively publish research papers2, you can find some of my academic publications here.
When not in front of a computer, I am likely being goofy with my son Avi and wife Aarthy.
Footnotes: