TY  - JOUR
AU  - Liu, Gaosheng
AU  - Yıldırım, Kasım Sinan
AU  - Wang, Lin
ID  - 61775
JF  - IEEE Transactions on Mobile Computing
SN  - 1536-1233
TI  - FreeBeacon: Efficient Communication and Data Aggregation in Battery-Free IoT
ER  - 
TY  - CONF
AU  - Apostolo, Guilherme Henrique
AU  - Bauszat, Pablo
AU  - Nigade, Vinod
AU  - Bal, Henri E.
AU  - Wang, Lin
ID  - 63054
T2  - Proceedings of the 31st Annual International Conference on Mobile Computing and Networking (MobiCom)
TI  - Uirapuru: Timely Video Analytics for High-Resolution Steerable Cameras on Edge Devices
ER  - 
TY  - JOUR
AU  - Pei, Qiangyu
AU  - Yuan, Yongjie
AU  - Hu, Haichuan
AU  - Wang, Lin
AU  - Zhang, Dong
AU  - Yan, Bingheng
AU  - Yu, Chen
AU  - Liu, Fangming
ID  - 63057
IS  - 4
JF  - IEEE Transactions on Sustainable Computing
SN  - 2377-3782
TI  - Working Smarter Not Harder: Hybrid Cooling for Deep Learning in Edge Datacenters
VL  - 10
ER  - 
TY  - CONF
AU  - Wu, Jing
AU  - Wang, Lin
AU  - Deng, Quanfeng
AU  - Yu, Chen
AU  - Zhang, Dong
AU  - Yan, Bingheng
AU  - Liu, Fangming
ID  - 63056
T2  - 2025 IEEE International Parallel and Distributed Processing Symposium (IPDPS)
TI  - It Takes Two to Tango: Serverless Workflow Serving via Bilaterally Engaged Resource Adaptation
ER  - 
TY  - CONF
AU  - Illian, Marvin
AU  - Luchterhandt, Björn
AU  - Wang, Lin
ID  - 61256
T2  - Proceedings of the 20th Workshop on Mobility in the Evolving Internet Architecture (MobiArch)
TI  - Band Switching for Mobile Energy Optimization in 5G Networks and Beyond
ER  - 
TY  - CONF
AU  - Ghafouri, Saeid
AU  - Razavi, Kamran
AU  - Salmani, Mehran
AU  - Sanaee, Alireza
AU  - Botran, Tania Lorido
AU  - Wang, Lin
AU  - Doyle, Joseph
AU  - Jamshidi, Pooyan
ID  - 63058
T2  - Companion of the 16th ACM/SPEC International Conference on Performance Engineering
TI  - IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency
ER  - 
TY  - CONF
AU  - Hu, Haichuan
AU  - Liu, Fangming
AU  - Pei, Qiangyu
AU  - Yuan, Yongjie
AU  - Xu, Zichen
AU  - Wang, Lin
ID  - 50807
T2  - Proceedings of the ACM Web Conference (WWW)
TI  - 𝜆Grapher: A Resource-Efficient Serverless System for GNN Serving through Graph Sharing
ER  - 
TY  - JOUR
AU  - Ghafouri, Saeid
AU  - Razavi, Kamran
AU  - Salmani, Mehran
AU  - Sanaee, Alireza
AU  - Lorido Botran, Tania 
AU  - Wang, Lin
AU  - Doyle, Joseph
AU  - Jamshidi, Pooyan
ID  - 53531
JF  - Journal of Systems Research (JSys)
TI  - IPA: Inference Pipeline Adaptation to Achieve High Accuracy and Cost-Efficiency
ER  - 
TY  - CONF
AU  - Razavi, Kamran
AU  - Davari Fard, Shayan
AU  - Karlos, George
AU  - Nigade, Vinod
AU  - Mühlhäuser, Max
AU  - Wang, Lin
ID  - 55365
T2  - Proceedings of the IEEE International Symposium on Computers and Communications (ISCC)
TI  - NetNN: Neural Intrusion Detection System in Programmable Networks (Second Best Paper Award)
ER  - 
TY  - CONF
AU  - Razavi, Kamran
AU  - Ghafouri, Saeid
AU  - Mühlhäuser, Max
AU  - Jamshidi, Pooyan
AU  - Wang, Lin
ID  - 53095
T2  - Proceedings of the 4th Workshop on Machine Learning and Systems (EuroMLSys), colocated with EuroSys 2024
TI  - Sponge: Inference Serving with Dynamic SLOs Using In-Place Vertical Scaling
ER  - 
TY  - CONF
AU  - Dou, Feng
AU  - Wang, Lin
AU  - Chen, Shutong
AU  - Liu, Fangming
ID  - 50066
T2  - Proceedings of the IEEE International Conference on Computer Communications (INFOCOM)
TI  - X-Stream: A Flexible, Adaptive Video Transformer for Privacy-Preserving Video Stream Analytics
ER  - 
TY  - CONF
AU  - Liu, Gaosheng
AU  - Nigade, Vinod
AU  - Bal, Henri
AU  - Wang, Lin
ID  - 53807
T2  - Proceedings of the 8th ACM Asia Pacific Workshop on Networking (APNET)
TI  - A Little Certainty is All We Need: Discovery and Synchronization Acceleration in Battery-Free IoT
ER  - 
TY  - CONF
AU  - Karlos, George
AU  - Bal, Henri
AU  - Wang, Lin
ID  - 55366
T2  - Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis (SC)
TI  - NetCL: A Unified Programming Framework for In-Network Computing
ER  - 
TY  - JOUR
AU  - Liu, Gaosheng
AU  - Wang, Lin
ID  - 55364
JF  - IEEE Transactions on Mobile Computing (TMC)
TI  - Data On the Go: Seamless Data Routing for Intermittently-Powered Battery-Free Sensing
ER  - 
TY  - CONF
AU  - Blöcher, Marcel
AU  - Nedderhut, Nils
AU  - Chuprikov, Pavel
AU  - Khalili, Ramin
AU  - Eugster, Patrick
AU  - Wang, Lin
ID  - 50065
T2  - Proceedings of the IEEE International Conference on Computer Communications (INFOCOM)
TI  - Train Once Apply Anywhere: Effective Scheduling for Network Function Chains Running on FUMES
ER  - 
TY  - CONF
AU  - Pei, Qiangyu
AU  - Wang, Lin
AU  - Zhang, Dong
AU  - Yan, Bingheng
AU  - Yu, Chen
AU  - Liu, Fangming
ID  - 56689
T2  - Proceedings of the 15th ACM Symposium on Cloud Computing (SoCC)
TI  - InferCool: Enhancing AI Inference Cooling through Transparent, Non-Intrusive Task Reassignment
ER  - 
TY  - JOUR
AU  - Hu, Jiahai
AU  - Wang, Lin
AU  - Wu, Jing
AU  - Pei, Qiangyu
AU  - Liu, Fangming
AU  - Li, Bo
ID  - 59074
JF  - Computer Networks
SN  - 1389-1286
TI  - A Comparative Measurement Study of Cross-Layer 5G Performance Under Different Mobility Scenarios
VL  - 257
ER  - 
TY  - JOUR
AB  - <jats:title>Abstract</jats:title><jats:p>While high accuracy is of paramount importance for deep learning (DL) inference, serving inference requests on time is equally critical but has not been carefully studied especially when the request has to be served over a dynamic wireless network at the edge. In this paper, we propose Jellyfish—a novel edge DL inference serving system that achieves soft guarantees for end-to-end inference latency service-level objectives (SLO). Jellyfish handles the network variability by utilizing both data and deep neural network (DNN) adaptation to conduct tradeoffs between accuracy and latency. Jellyfish features a new design that enables collective adaptation policies where the decisions for data and DNN adaptations are aligned and coordinated among multiple users with varying network conditions. We propose efficient algorithms to continuously map users and adapt DNNs at runtime, so that we fulfill latency SLOs while maximizing the overall inference accuracy. We further investigate <jats:italic>dynamic</jats:italic> DNNs, i.e., DNNs that encompass multiple architecture variants, and demonstrate their potential benefit through preliminary experiments. Our experiments based on a prototype implementation and real-world WiFi and LTE network traces show that Jellyfish can meet latency SLOs at around the 99th percentile while maintaining high accuracy.
</jats:p>
AU  - Nigade, Vinod
AU  - Bauszat, Pablo
AU  - Bal, Henri
AU  - Wang, Lin
ID  - 63059
IS  - 2
JF  - Real-Time Systems
SN  - 0922-6443
TI  - Inference serving with end-to-end latency SLOs over dynamic edge networks
VL  - 60
ER  - 
TY  - JOUR
AU  - Wu, Jing
AU  - Wang, Lin
AU  - Jin, Qirui
AU  - Liu, Fangming
ID  - 63060
IS  - 2
JF  - IEEE Transactions on Parallel and Distributed Systems
SN  - 1045-9219
TI  - Graft: Efficient Inference Serving for Hybrid Deep Learning With SLO Guarantees via DNN Re-Alignment
VL  - 35
ER  - 
TY  - THES
AU  - Schneider, Stefan Balthasar
ID  - 29672
TI  - Network and Service Coordination: Conventional and Machine Learning Approaches"
ER  -