
Spring 2025 Seminar Series
Tuesday, 3/4 -Insights Gained From Delivering Two Generations of AI Supercomputing and Storage Solutions in IBM Cloud
Dr. Seetharami Seelam, Distinguished Engineer at IBM Research
AI Supercomputers in public clouds serve as crucial components in the swift and cost-effective creation and deployment of cutting-edge AI models. This heightened demand for potent cloud-native AI supercomputers stems from the increasing prevalence of generative AI and foundational models. In these systems, numerous GPUs collaborate to facilitate model training, optimization, and serve countless concurrent applications without disruption. To ensure optimal performance, reliability, and adaptability for various AI workloads, a comprehensive solution integrating hardware, software, and holistic telemetry is essential. This solution enables the efficient and high-performance execution of multiple AI workload types while maintaining resilience. In this talk, Dr. Seelam will discuss two generations of Vela cloud-native AI systems in IBM Cloud, which form the backbone of IBM's AI endeavors. He will explore the scaling, performance, and high availability challenges confronted during their development and operation.
Moody Hall Auditorium | 12:00 pm - 1:00 pm
Slides from Dr. Seelam's presentation can be found here.