Optimizing Latency-Sensitive AI Applications Through Edge-Cloud Collaboration

Authors

  • Jiang Wu Computer Science, University of Southern California, Los Angeles, CA, USA Author
  • Hongbo Wang Computer Science, University of Southern California, Los Angeles, CA, USA Author
  • Kun Qian Business Intelligence, Engineering School of Information and Digital Technologies, Villejuif,  France Author
  • Enmiao Feng Electrical & Computer Engineering, Duke University, NC, USA Author

DOI:

https://doi.org/10.69987/JACS.2023.30303

Keywords:

Edge-cloud collaboration, Latency optimization, Adaptive workload partitioning, Resource allocation

Abstract

This paper presents a novel framework for optimizing latency-sensitive AI applications through intelligent edge-cloud collaboration. The proposed approach addresses critical challenges in deploying computationally intensive AI workloads across distributed computing environments while meeting stringent timing requirements. The framework introduces an adaptive workload partitioning mechanism that dynamically distributes computational tasks based on application-specific latency requirements, resource availability, and network conditions. A comprehensive resource allocation strategy optimizes utilization across the computing continuum through specialized scheduling algorithms that prioritize time-sensitive operations. Communication protocol optimizations reduce data transfer overhead through context-aware compression techniques and adaptive packet sizing. Experimental evaluation conducted across heterogeneous computing environments demonstrates significant performance improvements, achieving latency reductions of 50-62% compared to baseline approaches. Resource utilization patterns show increased edge resource efficiency (83.4%) while reducing cloud resource consumption (31.1%). Energy efficiency metrics indicate substantial improvements across application categories, with energy-per-transaction reductions ranging from 50.0% to 60.6%. The framework maintains performance standards under challenging operational conditions, including network congestion and limited resource availability, validating its applicability for real-world deployment scenarios. The results demonstrate that intelligent edge-cloud collaboration can significantly enhance performance for latency-sensitive AI applications while improving overall system efficiency.

Author Biography

  • Enmiao Feng, Electrical & Computer Engineering, Duke University, NC, USA

     

     

Downloads

Published

2023-03-12

How to Cite

Jiang Wu, Hongbo Wang, Kun Qian, & Enmiao Feng. (2023). Optimizing Latency-Sensitive AI Applications Through Edge-Cloud Collaboration. Journal of Advanced Computing Systems , 3(3), 19-33. https://doi.org/10.69987/JACS.2023.30303

Share