π
DeepSpeed- Inference: Enabling Efficient Inference of Transformer Models at Unprecedented Scale paper by Reza Yazdani Aminabadi, Samyam Rajbhandari, Minjia Zhang, A. A. Awan, Cheng Li, Du Li, Elton Zheng, Jeff Rasley, Shaden Smith, Olatunji Ruwase, Yuxiong He
β 61.1
π¬Technical Deep Dive
Full Specifications [+]
π¦Data Source: semantic_scholar
π Updated dailySource summary: Based on semantic_scholar metadata. Not a recommendation.
π‘οΈ Paper Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
π Identity & Source
- id
- 2207.00032
- slug
- 2207.00032
- source
- semantic_scholar
- author
- Reza Yazdani Aminabadi, Samyam Rajbhandari, Minjia Zhang, A. A. Awan, Cheng Li, Du Li, Elton Zheng, Jeff Rasley, Shaden Smith, Olatunji Ruwase, Yuxiong He
- license
- ArXiv
- tags
- paper, research, academic
βοΈ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
π Engagement & Metrics
- downloads
- 0
- stars
- 0
- forks
- 0
- citations
- 518
Data indexed from public sources. Updated daily.