RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation Paper • 2408.02545 • Published Aug 5 • 35
Accelerating Speculative Decoding using Dynamic Speculation Length Paper • 2405.04304 • Published May 7 • 2
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23 • 16
Distributed Speculative Inference of Large Language Models Paper • 2405.14105 • Published May 23 • 16
ABSApp: A Portable Weakly-Supervised Aspect-Based Sentiment Extraction System Paper • 1909.05608 • Published Sep 12, 2019
Term Set Expansion based NLP Architect by Intel AI Lab Paper • 1808.08953 • Published Aug 27, 2018 • 1
Term Set Expansion based on Multi-Context Term Embeddings: an End-to-end Workflow Paper • 1807.10104 • Published Jul 26, 2018 • 1
Accelerating Speculative Decoding using Dynamic Speculation Length Paper • 2405.04304 • Published May 7 • 2