News
Newest
Ask
Show
Jobs
Open on GitHub
Speculative pre-positioning: off-path decode for stateful inference sessions
(arxiv.org)
1 points | by
logotype
3 hours ago
1 comments
logotype
1 hour ago
With native support for sessions in an inference engine we can make use of idle GPUs... doing work!
1 comments