Export Citations
1 Results for: Keyword: Queuing-Based model
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
Searched The ACM Guide to Computing Literature (3,836,434 records)|Limit your search to The ACM Full-Text Collection (774,263 records)
- research-articleAugust 2024
DeInfer: A GPU resource allocation algorithm with spatial sharing for near-deterministic inferring tasks
ICPP '24: Proceedings of the 53rd International Conference on Parallel ProcessingPages 701–711https://doi.org/10.1145/3673038.3673091For the applications of artificial intelligence, training models with GPUs are widely noted, while inferring requirements are somehow neglected. In some scenario, it is quite important to finish the deep learning inference (DLI) task and get the ...