Proceedings of the Workshop on Edge and Mobile Foundation Models

Deploying large language models (LLMs) inference into mobile devices is cost-efficient for companies, and well addresses the privacy concern of users. However, the limited computation capacity and memory constraints of mobile devices hinder their ...

- 0
- 12
Metrics
Total Citations0
Total Downloads12
Last 12 Months12
Last 6 weeks12

Abstract
View online with eReader
PDF

short-paper

Open Access

WiP: An On-device LLM-based Approach to Query Privacy Protection

Yizhen Yuan,
Rui Kong,
Yuanchun Li,
Yunxin Liu

pp 7–9https://doi.org/10.1145/3662006.3662060

Privacy leakage from user queries is a widely-concerned issue in search engines and chatbot services. Existing solutions based on privacy information removal, obfuscation, and encryption may inevitably hurt service quality or require full trust of the ...

- 0
- 13
Metrics
Total Citations0
Total Downloads13
Last 12 Months13
Last 6 weeks13

Abstract
View online with eReader
PDF

research-article

Free

Towards a Task-agnostic Distillation Methodology for Creating Edge Foundation Models

Swarnava Dey,
Arijit Mukherjee,
Arijit Ukil,
Arpan Pal

pp 10–15https://doi.org/10.1145/3662006.3662061

In recent years, AI has undergone significant changes. Firstly, there is a growing recognition of the need to deploy inference models based on Deep Neural Networks (DNNs) on edge devices. Secondly, there is an increasing demand for low-energy inferencing ...

- 0
- 10
Metrics
Total Citations0
Total Downloads10
Last 12 Months10
Last 6 weeks10

Abstract
View online with eReader
PDF

short-paper

Free

WiP: A Solution for Reducing MLLM-Based Agent Interaction Overhead

Wenjie Li,
Xiaoyang Liu,
Zihao Zheng,
Jishun Wang,
Kang Ling,
Ming Fu

pp 16–17https://doi.org/10.1145/3662006.3662062

Current Multi-modal LLM-based mobile agents are associated with concerns over high inference time and cost. We propose to tackle these issues by developing a lightweight UI Transition Graph (UTG) and locally executing automatic tasks. Specifically, we ...

- 0
- 9
Metrics
Total Citations0
Total Downloads9
Last 12 Months9
Last 6 weeks9

Abstract
View online with eReader
PDF

research-article

Open Access

ChainStream: A Stream-based LLM Agent Framework for Continuous Context Sensing and Sharing

Jiacheng Liu,
Wenxing Xu,
Yuanchun Li

pp 18–23https://doi.org/10.1145/3662006.3662063

This paper introduces ChainStream, an LLM-based framework for building and serving context-aware AI agents. Driven by the goal to enable context awareness of LLM agents and flexible information sharing between them, we adopt a stream-based design, in ...

- 0
- 8
Metrics
Total Citations0
Total Downloads8
Last 12 Months8
Last 6 weeks8

Abstract
View online with eReader
PDF

research-article

Open Access

Are Large Language Models Capable of Causal Reasoning for Sensing Data Analysis?

Zhizhang Hu,
Yue Zhang,
Ryan Rossi,
Tong Yu,
Sungchul Kim,
Shijia Pan

pp 24–29https://doi.org/10.1145/3662006.3662064

The correlation analysis between socioeconomic factors and environmental impact is essential for policy making to ensure sustainability and economic development simultaneously. With the development of Internet of Things (IoT), citizen science IoT ...

- 0
- 16
Metrics
Total Citations0
Total Downloads16
Last 12 Months16
Last 6 weeks16

Abstract
View online with eReader
PDF

short-paper

Open Access

WiP: Towards Light Adaptation of Large Language Models For Personal Hardware

Liangyu Wang,
Junxiao Wang,
Di Wang

pp 30–32https://doi.org/10.1145/3662006.3662065

The large language models (LLMs) that everyone is using are not deployed locally. Users need to send relatively private and important data to LLM when using it. Handing over private and important data to LLM will cause people to worry, especially now ...

- 0
- 11
Metrics
Total Citations0
Total Downloads11
Last 12 Months11
Last 6 weeks11

Abstract
View online with eReader
PDF

short-paper

Free

WiP: Efficient LLM Prefilling with Mobile NPU

Daliang Xu,
Hao Zhang,
Liming Yang,
Ruiqi Liu,
Mengwei Xu,
Xuanzhe Liu

pp 33–35https://doi.org/10.1145/3662006.3662066

Large language models (LLMs) play a crucial role in various Natural Language Processing (NLP) tasks, prompting their deployment on mobile devices for inference. However, a significant challenge arises due to high waiting latency, especially for long ...

- 0
- 18
Metrics
Total Citations0
Total Downloads18
Last 12 Months18
Last 6 weeks18

Abstract
View online with eReader
PDF

research-article

Open Access

Hybrid SLM and LLM for Edge-Cloud Collaborative Inference

Zixu Hao,
Huiqiang Jiang,
Shiqi Jiang,
Ju Ren,
Ting Cao

pp 36–41https://doi.org/10.1145/3662006.3662067

Edge-Cloud collaboration for deep learning inference has been actively studied, to enhance the inference performance by leveraging both Edge and Cloud resources. However, traditional Edge-Cloud collaboration based on model partitioning or confidence ...

- 0
- 25
Metrics
Total Citations0
Total Downloads25
Last 12 Months25
Last 6 weeks25

Abstract
View online with eReader
PDF

Save to Binder

Create a New Binder

Name

Recommendations

MobiOpp '10: Proceedings of the Second International Workshop on Mobile Opportunistic Networking
Read More
WOWMOM '02: Proceedings of the 5th ACM international workshop on Wireless mobile multimedia
Read More
MobiDE '12: Proceedings of the Eleventh ACM International Workshop on Data Engineering for Wireless and Mobile Access
Read More

Comments

Export Citations

Select Citation format

Please download or close your previous search result export first before starting a new bulk export.
Preview is not available.
By clicking download,a status dialog will open to start the export process. The process may takea few minutes but once it finishes a file will be downloadable from your browser. You may continue to browse the DL while the export process is in progress.
Download
- Download citation
- Copy citation

Save to Binder

Sections

Proceeding Downloads

Save to Binder

Recommendations

MobiOpp '10: Proceedings of the Second International Workshop on Mobile Opportunistic Networking

WOWMOM '02: Proceedings of the 5th ACM international workshop on Wireless mobile multimedia

MobiDE '12: Proceedings of the Eleventh ACM International Workshop on Data Engineering for Wireless and Mobile Access