π
SpecInfer: Accelerating Large Language Model Serving with Tree-based Speculative Inference and Verification paper by Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Zeyu Wang, Zhengxin Zhang, Rae Ying Yee Wong, Alan Zhu, Lijie Yang, Xiaoxiang Shi, Chunan Shi, Zhuoming Chen, Daiyaan Arfeen, Reyna Abhyankar, Zhihao Jia
β 59.8
π¬Technical Deep Dive
Full Specifications [+]
π¦Data Source: semantic_scholar
π Updated dailySource summary: Based on semantic_scholar metadata. Not a recommendation.
π‘οΈ Paper Transparency Report
Technical metadata sourced from upstream repositories.
Open Metadata
π Identity & Source
- id
- 2305.09781
- slug
- 2305.09781
- source
- semantic_scholar
- author
- Xupeng Miao, Gabriele Oliaro, Zhihao Zhang, Xinhao Cheng, Zeyu Wang, Zhengxin Zhang, Rae Ying Yee Wong, Alan Zhu, Lijie Yang, Xiaoxiang Shi, Chunan Shi, Zhuoming Chen, Daiyaan Arfeen, Reyna Abhyankar, Zhihao Jia
- license
- ArXiv
- tags
- paper, research, academic
βοΈ Technical Specs
- architecture
- null
- params billions
- null
- context length
- null
- pipeline tag
π Engagement & Metrics
- downloads
- 0
- stars
- 0
- forks
- 0
- citations
- 270
Data indexed from public sources. Updated daily.