Steve

Sign in

Cloud jobs · region SR006

13 / 32
our GPUs used
6
active jobs
19
free in quota
368
free in cluster

Our active jobs by instance:

  • 1 × 8-GPU a100plus.8gpu.80vG.96C.768G → 8 GPU
  • 3 × 1-GPU a100plus.1gpu.80vG.12C.182G → 3 GPU
  • 2 × 1-GPU a100plus.1gpu.80vG.12C.96G → 2 GPU
Cluster availability — 368 GPUs free in SR006 · updated 02:36:45 MSK
Flavor Instances free GPU/inst Total GPU free
8 GPU H100(A100+) 80 GB, 96 CPU, 1952 Gb RAM
a100plus.8gpu.80vG.96C.1952G
2 8 16
7 GPU H100(A100+) 80 GB, 84 CPU, 1708 Gb RAM
a100plus.7gpu.80vG.84C.1708G
5 7 35
6 GPU H100(A100+) 80 GB, 72 CPU, 1464 Gb RAM
a100plus.6gpu.80vG.72C.1464G
8 6 48
5 GPU H100(A100+) 80 GB, 60 CPU, 1220 Gb RAM
a100plus.5gpu.80vG.60C.1220G
9 5 45
4 GPU H100(A100+) 80 GB, 48 CPU, 976 Gb RAM
a100plus.4gpu.80vG.48C.976G
11 4 44
3 GPU H100(A100+) 80 GB, 36 CPU, 732 Gb RAM
a100plus.3gpu.80vG.36C.732G
18 3 54
2 GPU H100(A100+) 80 GB, 24 CPU, 488 Gb RAM
a100plus.2gpu.80vG.24C.488G
30 2 60
1 GPU H100(A100+) 80 GB, 12 CPU, 244 Gb RAM
a100plus.1gpu.80vG.12C.244G
66 1 66

Cluster-wide capacity, shared with all workspaces — our quota (32 GPU) is a separate ceiling not exposed by the API.

Window: last 7 days · 1d · 7d · 30d

Job Status GPU Instance Created Duration
lm-mpi-job-833cb38e-6e51-4063-a51a-fae768542145
hydra torch211 idle worker — NFS RPC interactive
✗ Failed a100plus.1gpu.80vG.12C.96G 2026-05-11 15:28 MSK 8.3 h
lm-mpi-job-a60eccc3-25c1-4bc2-8056-d46d28503cbc
hydra-test idle worker
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-11 13:02 MSK 40 min
lm-mpi-job-02b50ecf-3e75-45dd-9a8c-0afc27869d29
test #gradmem/iltiakov
✓ Completed a100plus.1gpu.80vG.12C.96G 2026-05-11 02:45 MSK 10.4 h
lm-mpi-job-93a4bd93-0204-4ddd-9b26-ba689bb29611
test #gradmem/iltiakov
✓ Completed a100plus.1gpu.80vG.12C.96G 2026-05-11 02:39 MSK 7.1 h
lm-mpi-job-b843ff96-b8a3-4bc0-8d30-dcca8cd642d0
test #gradmem/iltiakov
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-11 02:34 MSK 11.2 h
lm-mpi-job-ab366744-cef0-4337-a580-8a70bdbe6c2d
test #gradmem/iltiakov
✓ Completed a100plus.1gpu.80vG.12C.96G 2026-05-11 02:14 MSK 5.7 h
lm-mpi-job-edd02bcb-8527-4b2c-a13e-9ef63d27da1f
test #gradmem/iltiakov
✓ Completed a100plus.1gpu.80vG.12C.96G 2026-05-11 01:31 MSK 8.6 h
lm-mpi-job-2e89fe08-5aa5-4c99-a200-538dc518f342
test #gradmem/iltiakov
✓ Completed a100plus.1gpu.80vG.12C.96G 2026-05-11 01:31 MSK 5.9 h
lm-mpi-job-b0ec105b-d385-406a-9d07-d16c9f5b9c3a
alatyshev:cosmos_reason_best8_ar
Running a100plus.1gpu.80vG.12C.182G 2026-05-11 00:31 MSK 26.1 h (running)
lm-mpi-job-9df3d45e-c8e0-45e6-b668-7b2e7e5564a6
alatyshev:cosmos_reason_best8_ar
✗ Failed a100plus.1gpu.80vG.12C.182G 2026-05-11 00:29 MSK 1 min
lm-mpi-job-45f7078d-81bd-45f5-8915-0eac947bb446
alatyshev:cosmos_reason_best8
Running a100plus.1gpu.80vG.12C.182G 2026-05-11 00:28 MSK 26.2 h (running)
lm-mpi-job-39dccc1b-8afc-460b-a970-e55356f7aa4c
alatyshev:cosmos_reason_best8
✗ Failed a100plus.1gpu.80vG.12C.182G 2026-05-10 22:02 MSK 18 min
lm-mpi-job-5c1cd396-e1f0-4e54-9bb2-9e87d322e612
alatyshev:cosmos_reason_best8_ar
✗ Failed a100plus.1gpu.80vG.12C.182G 2026-05-10 22:00 MSK 21 min
lm-mpi-job-87259872-d972-4ed3-9e93-5b1fe3350b84
alatyshev:cosmos_reason_best1
✗ Failed a100plus.1gpu.80vG.12C.182G 2026-05-10 22:00 MSK 17 min
lm-mpi-job-758072e7-bf0a-4b7a-a405-ab59bdab0901
test #gradmem/iltiakov
✓ Completed a100plus.1gpu.80vG.12C.96G 2026-05-10 21:21 MSK 13.1 h
lm-mpi-job-857ee1e4-6878-4637-af5b-d1a6a2073555
test #gradmem/iltiakov
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-10 20:10 MSK 6.1 h
lm-mpi-job-9c84fe67-eb22-4296-9717-e452f3193ae1
test #gradmem/iltiakov
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-10 19:51 MSK 20 min
lm-mpi-job-4a522162-1142-45cc-a36c-b3eaf3dd075d
test #gradmem/iltiakov
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-10 19:30 MSK 6.7 h
lm-mpi-job-ff0913d4-1d1f-4f14-a4b4-e06e6e5b11eb
alatyshev:cosmos
✗ Stopped a100plus.1gpu.80vG.12C.182G 2026-05-10 19:28 MSK 2.6 h
lm-mpi-job-d61f5ad3-91cb-4de7-9e4c-62d89790c153
alatyshev:cosmos
✗ Failed a100plus.1gpu.80vG.12C.182G 2026-05-10 19:26 MSK 1 min
lm-mpi-job-dc9e44af-401e-4156-9492-89dcad840afa
test #gradmem/iltiakov
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-10 19:09 MSK 7.1 h
lm-mpi-job-64c0888d-3af2-4818-bb56-b009b30ff8f5
alatyshev:eval_cosmos_ar_val
Running a100plus.1gpu.80vG.12C.182G 2026-05-10 16:44 MSK 33.9 h (running)
lm-mpi-job-75c2315b-a2a6-4468-a201-7a030dba36ec
alatyshev:eval_cosmos_ar_val
✗ Failed a100plus.1gpu.80vG.12C.182G 2026-05-10 16:33 MSK 4 min
lm-mpi-job-b4161fd9-d9bb-4c62-9b80-5aba192a8a55
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-09 02:06 MSK 1.6 h
lm-mpi-job-c26ad29f-1eb3-4bdc-aac6-dc6a7e4f3499
phase2 cuda-lns v2 pilot postpull (HEAD 0417c8c) — 5 ids reproduce
✓ Completed a100plus.1gpu.80vG.12C.244G 2026-05-09 00:40 MSK 1.2 h
lm-mpi-job-880af590-513b-446b-bc33-8cac4f32851f
phase2 cuda-lns v2 pilot postpull (HEAD 0417c8c) — 5 ids reproduce
✗ Stopped a100plus.1gpu.80vG.12C.244G 2026-05-09 00:26 MSK 13 min
lm-mpi-job-acce7718-8ff7-441d-9272-94d5c30b9ce2
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 19:04 MSK 8.6 h
lm-mpi-job-a50e9dde-0227-4964-af7d-5ffa6069fa8e
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 19:04 MSK 8.6 h
lm-mpi-job-caa172f4-c7e8-4b71-9f7b-edc833f4d2b9
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 19:03 MSK 8.6 h
lm-mpi-job-d42a3b4b-a41d-453a-824f-f5b87200937b
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 19:03 MSK 6.9 h
lm-mpi-job-548619d3-637a-4c03-916a-1ae23c79f4a8
nsorokin-dev-2-gpu
✓ Completed a100plus.2gpu.80vG.24C.488G 2026-05-08 17:36 MSK 3.0 h
lm-mpi-job-48c3893e-96fc-49b9-88dd-4a03f04ddf5c
nsorokin-dev-2-gpu
✓ Completed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:48 MSK 3.0 h
lm-mpi-job-7453277e-8cc1-4246-b09f-42cf132719cd
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:45 MSK 2 min
lm-mpi-job-b7948709-5cc0-4208-bb9b-2783cbd593cd
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:45 MSK 2 min
lm-mpi-job-304b8bb2-36dc-43ce-9666-6a957bb4764d
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:45 MSK 2 min
lm-mpi-job-c2bd1a28-9bae-40f3-8f05-64842600b721
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:45 MSK 2 min
lm-mpi-job-3323eedf-e558-4479-a868-df725bfd0953
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:40 MSK 2 min
lm-mpi-job-9afc0b22-8aaf-48db-8f80-1fd2199050f7
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:40 MSK 2 min
lm-mpi-job-6b436438-0b60-421e-975a-a29ca1793634
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:40 MSK 2 min
lm-mpi-job-2e161ffa-1233-4e46-9508-3f6f522c8f1d
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:40 MSK 2 min
lm-mpi-job-5cb68ee5-b986-49db-969a-a824ea4921d1
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:30 MSK 1 min
lm-mpi-job-29025e54-3b0c-4efa-aff4-ee0f0018794b
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:30 MSK 1 min
lm-mpi-job-79cc3a86-0312-4921-9d24-043e3139101f
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:30 MSK 1 min
lm-mpi-job-8eb576e6-71c7-4597-b298-9860f603c234
nsorokin-agent_craftext-eval
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-08 13:30 MSK 1 min
lm-mpi-job-b64586c5-f76b-47ac-8d3c-c8554d31a016
phase2 v2 cuda-lns persistent pilot 5
✓ Completed a100plus.1gpu.80vG.12C.244G 2026-05-08 00:09 MSK 26 min
lm-mpi-job-25b1f14c-850c-49be-a2bf-769bd0c14a3f
phase2 v2 cuda-lns persistent pilot 5
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-07 23:32 MSK 1 min
lm-mpi-job-1009f05e-9d85-4421-8601-4a5de0626f04
nsorokin-agent_craftext-train
✓ Completed a100plus.2gpu.80vG.24C.488G 2026-05-07 19:23 MSK 40.0 h
lm-mpi-job-0dfdc274-81d6-4c3f-aa99-a16cf5e42785
nsorokin-agent_craftext-train
✓ Completed a100plus.2gpu.80vG.24C.488G 2026-05-07 19:23 MSK 39.8 h
lm-mpi-job-a1b93be7-55ff-44c6-a2c8-e229afbdf42d
nsorokin-agent_craftext-train
✓ Completed a100plus.2gpu.80vG.24C.488G 2026-05-07 19:23 MSK 39.9 h
lm-mpi-job-7893d6d4-cffd-4f3b-b4a3-97037038f93e
nsorokin-agent_craftext-train
✓ Completed a100plus.2gpu.80vG.24C.488G 2026-05-07 19:23 MSK 35.4 h
lm-mpi-job-eef6c185-bf85-4f28-b9b0-edd5359ee93e
nsorokin-dev-2-gpu
✓ Completed a100plus.2gpu.80vG.24C.488G 2026-05-07 15:49 MSK 5.5 h
lm-mpi-job-0a506c19-c2e2-4ecd-b263-bd87f597ae18
alatyshev:eval_cosmos
✗ Stopped cpu.8C.32G 2026-05-07 15:23 MSK 51 min
lm-mpi-job-27c8e824-606b-4a54-bab1-ce7861f0fd18
alatyshev:eval_cosmos_2_vneg10
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-07 15:16 MSK 58 min
lm-mpi-job-ea0dc8b1-3b84-4b85-b0cb-59fad032157c
zvolovikova-craftext-minimal
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-06 17:25 MSK 4.6 h
lm-mpi-job-e5370c30-0b51-4ded-95ec-ebd042517765
zvolovikova-craftext-minimal
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-06 17:25 MSK 3.8 h
lm-mpi-job-d6ebb5a9-cd4e-4986-8818-022adfa4c39a
nsorokin-agent_crafter-oracle-balrog-base-cot-train
✗ Failed a100plus.4gpu.80vG.48C.976G 2026-05-06 14:52 MSK 12.7 h
lm-mpi-job-e4e502a7-48b7-452b-96b7-0d8ad203b6d0
nsorokin-agent_crafter-balrog-min-cot-train
✗ Failed a100plus.2gpu.80vG.24C.488G 2026-05-06 14:43 MSK 5.0 h
lm-mpi-job-e57d5800-e785-4884-a63a-43227cbbd03d
zvolovikova-craftext-minimal-gspo
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-05 14:58 MSK 19.3 h
lm-mpi-job-25102c01-72d6-498f-b2d2-187c8219a7ea
zvolovikova-craftext-minimal-gspo
✗ Stopped a100plus.1gpu.80vG.12C.244G 2026-05-05 14:57 MSK 20.0 h
lm-mpi-job-74554e98-4953-4f8e-a76a-5b7c1ea2350a
zvolovikova-craftext-minimal-gspo
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-05 14:56 MSK 18.3 h
lm-mpi-job-b96b58f8-ce2f-4672-b801-76948d30fad5
zvolovikova-craftext-minimal-gspo
✗ Stopped a100plus.1gpu.80vG.12C.244G 2026-05-05 14:01 MSK 21.0 h
lm-mpi-job-e57f27f7-0c2d-4681-82c2-38916935affa
zvolovikova-craftext-minimal-gspo
✗ Stopped a100plus.1gpu.80vG.12C.244G 2026-05-05 13:55 MSK 6 min
lm-mpi-job-94455db2-fa76-45ba-9887-756d7c69c6c0
zvolovikova-craftext-minimal-gspo
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-05 13:18 MSK 6.7 h
lm-mpi-job-df0d659c-d5f8-45b2-8e01-031587b226a2
zvolovikova-craftext-minimal-gspo
✗ Stopped a100plus.1gpu.80vG.12C.244G 2026-05-05 13:18 MSK 1 min
lm-mpi-job-d58d7897-7bd8-4b2f-a671-6a119023d962
zvolovikova-craftext-minimal-gspo
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-05 12:54 MSK 16 min
lm-mpi-job-fd771f2f-3240-4be9-b46f-20cbd719da37
zvolovikova-craftext-minimal-gspo
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-05 12:38 MSK 13 min
lm-mpi-job-77cde842-94c3-42f7-9522-a8467ed2fd12
zvolovikova-craftext-minimal-gspo
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-05 12:20 MSK 12 min
lm-mpi-job-f4503af0-5add-4a66-b68b-a2d4ca190ac3
zvolovikova-craftext-minimal-gspo
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-05 12:19 MSK 1 min
lm-mpi-job-4f945447-7ff3-46d4-9c24-58ae5ea83f36
zvolovikova-craftext-minimal-gspo
✗ Failed a100plus.1gpu.80vG.12C.244G 2026-05-05 12:15 MSK 1 min
lm-mpi-job-a9c9695d-3471-4c1f-b918-1816b0256bb5
phase2 cuda-lns new 777
✓ Completed a100plus.1gpu.80vG.12C.244G 2026-05-05 11:31 MSK 41.3 h
lm-mpi-job-31071f3e-0f72-42cc-8466-5204d4e7b481
CogAI-ugadiarov.1gpu
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-05 09:57 MSK 55.9 h
lm-mpi-job-148a6f44-eb37-4deb-8ee1-25adbfc07f66
CogAI-ugadiarov.1gpu
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-05 08:49 MSK 57.0 h
lm-mpi-job-b4956565-3242-4e31-85a5-b68af60f9b90
CogAI-ugadiarov.1gpu
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-05 08:49 MSK 1.2 h
lm-mpi-job-d852cffd-7d55-4d9f-951e-5eaf3dc4eeba
CogAI-ugadiarov.1gpu
✗ Stopped a100plus.1gpu.80vG.12C.96G 2026-05-05 08:49 MSK 57.0 h