Show HN: Autonomous recovery for distributed training jobs

(docs.tensorpool.dev)

9 points | by tsvoboda 2 days ago ago

3 comments