matlok
's Collections
Papers - IoT
updated
MobileLLM: Optimizing Sub-billion Parameter Language Models for
On-Device Use Cases
Paper
•
2402.14905
•
Published
•
127
Sensor-based Multi-Robot Search and Coverage with Spatial Separation in
Unstructured Environments
Paper
•
2403.01710
•
Published
•
2
EdgeMoE: Fast On-Device Inference of MoE-based Large Language Models
Paper
•
2308.14352
•
Published
Slimmable Encoders for Flexible Split DNNs in Bandwidth and Resource
Constrained IoT Systems
Paper
•
2306.12691
•
Published
•
2
Bias Loss for Mobile Neural Networks
Paper
•
2107.11170
•
Published
•
2
MicroNAS: Memory and Latency Constrained Hardware-Aware Neural
Architecture Search for Time Series Classification on Microcontrollers
Paper
•
2310.18384
•
Published
•
2
Pattern Discovery in Time Series with Byte Pair Encoding
Paper
•
2106.00614
•
Published
•
2
Towards a World-English Language Model for On-Device Virtual Assistants
Paper
•
2403.18783
•
Published
•
4
Transformer-Lite: High-efficiency Deployment of Large Language Models on
Mobile Phone GPUs
Paper
•
2403.20041
•
Published
•
34
Octopus v2: On-device language model for super agent
Paper
•
2404.01744
•
Published
•
57
LLM in a flash: Efficient Large Language Model Inference with Limited
Memory
Paper
•
2312.11514
•
Published
•
257
Octopus v4: Graph of language models
Paper
•
2404.19296
•
Published
•
116