Cloud jobs · region SR006
13 / 32
our GPUs used
6
active jobs
19
free in quota
368
free in cluster
- 1 × 8-GPU a100plus.8gpu.80vG.96C.768G → 8 GPU
- 3 × 1-GPU a100plus.1gpu.80vG.12C.182G → 3 GPU
- 2 × 1-GPU a100plus.1gpu.80vG.12C.96G → 2 GPU
Cluster availability — 368 GPUs free in SR006 · updated 01:26:23 MSK
| Flavor | Instances free | GPU/inst | Total GPU free |
|---|---|---|---|
8 GPU H100(A100+) 80 GB, 96 CPU, 1952 Gb RAMa100plus.8gpu.80vG.96C.1952G |
2 | 8 | 16 |
7 GPU H100(A100+) 80 GB, 84 CPU, 1708 Gb RAMa100plus.7gpu.80vG.84C.1708G |
5 | 7 | 35 |
6 GPU H100(A100+) 80 GB, 72 CPU, 1464 Gb RAMa100plus.6gpu.80vG.72C.1464G |
8 | 6 | 48 |
5 GPU H100(A100+) 80 GB, 60 CPU, 1220 Gb RAMa100plus.5gpu.80vG.60C.1220G |
9 | 5 | 45 |
4 GPU H100(A100+) 80 GB, 48 CPU, 976 Gb RAMa100plus.4gpu.80vG.48C.976G |
11 | 4 | 44 |
3 GPU H100(A100+) 80 GB, 36 CPU, 732 Gb RAMa100plus.3gpu.80vG.36C.732G |
18 | 3 | 54 |
2 GPU H100(A100+) 80 GB, 24 CPU, 488 Gb RAMa100plus.2gpu.80vG.24C.488G |
30 | 2 | 60 |
1 GPU H100(A100+) 80 GB, 12 CPU, 244 Gb RAMa100plus.1gpu.80vG.12C.244G |
66 | 1 | 66 |
| Job | Status | GPU | Instance | Created | Duration |
|---|---|---|---|---|---|
lm-mpi-job-833cb38e-6e51-4063-a51a-fae768542145
hydra torch211 idle worker — NFS RPC interactive |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-11 15:28 MSK | 8.3 h |
lm-mpi-job-a60eccc3-25c1-4bc2-8056-d46d28503cbc
hydra-test idle worker |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-11 13:02 MSK | 40 min |
lm-mpi-job-02b50ecf-3e75-45dd-9a8c-0afc27869d29
test #gradmem/iltiakov |
✓ Completed | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-11 02:45 MSK | 10.4 h |
lm-mpi-job-93a4bd93-0204-4ddd-9b26-ba689bb29611
test #gradmem/iltiakov |
✓ Completed | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-11 02:39 MSK | 7.1 h |
lm-mpi-job-b843ff96-b8a3-4bc0-8d30-dcca8cd642d0
test #gradmem/iltiakov |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-11 02:34 MSK | 11.2 h |
lm-mpi-job-ab366744-cef0-4337-a580-8a70bdbe6c2d
test #gradmem/iltiakov |
✓ Completed | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-11 02:14 MSK | 5.7 h |
lm-mpi-job-edd02bcb-8527-4b2c-a13e-9ef63d27da1f
test #gradmem/iltiakov |
✓ Completed | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-11 01:31 MSK | 8.6 h |
lm-mpi-job-2e89fe08-5aa5-4c99-a200-538dc518f342
test #gradmem/iltiakov |
✓ Completed | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-11 01:31 MSK | 5.9 h |
lm-mpi-job-b0ec105b-d385-406a-9d07-d16c9f5b9c3a
alatyshev:cosmos_reason_best8_ar |
Running | 1× | a100plus.1gpu.80vG.12C.182G | 2026-05-11 00:31 MSK | 24.9 h (running) |
lm-mpi-job-9df3d45e-c8e0-45e6-b668-7b2e7e5564a6
alatyshev:cosmos_reason_best8_ar |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.182G | 2026-05-11 00:29 MSK | 1 min |
lm-mpi-job-45f7078d-81bd-45f5-8915-0eac947bb446
alatyshev:cosmos_reason_best8 |
Running | 1× | a100plus.1gpu.80vG.12C.182G | 2026-05-11 00:28 MSK | 25.0 h (running) |
lm-mpi-job-39dccc1b-8afc-460b-a970-e55356f7aa4c
alatyshev:cosmos_reason_best8 |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.182G | 2026-05-10 22:02 MSK | 18 min |
lm-mpi-job-5c1cd396-e1f0-4e54-9bb2-9e87d322e612
alatyshev:cosmos_reason_best8_ar |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.182G | 2026-05-10 22:00 MSK | 21 min |
lm-mpi-job-87259872-d972-4ed3-9e93-5b1fe3350b84
alatyshev:cosmos_reason_best1 |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.182G | 2026-05-10 22:00 MSK | 17 min |
lm-mpi-job-758072e7-bf0a-4b7a-a405-ab59bdab0901
test #gradmem/iltiakov |
✓ Completed | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-10 21:21 MSK | 13.1 h |
lm-mpi-job-857ee1e4-6878-4637-af5b-d1a6a2073555
test #gradmem/iltiakov |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-10 20:10 MSK | 6.1 h |
lm-mpi-job-9c84fe67-eb22-4296-9717-e452f3193ae1
test #gradmem/iltiakov |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-10 19:51 MSK | 20 min |
lm-mpi-job-4a522162-1142-45cc-a36c-b3eaf3dd075d
test #gradmem/iltiakov |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-10 19:30 MSK | 6.7 h |
lm-mpi-job-ff0913d4-1d1f-4f14-a4b4-e06e6e5b11eb
alatyshev:cosmos |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.182G | 2026-05-10 19:28 MSK | 2.6 h |
lm-mpi-job-d61f5ad3-91cb-4de7-9e4c-62d89790c153
alatyshev:cosmos |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.182G | 2026-05-10 19:26 MSK | 1 min |
lm-mpi-job-dc9e44af-401e-4156-9492-89dcad840afa
test #gradmem/iltiakov |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-10 19:09 MSK | 7.1 h |
lm-mpi-job-64c0888d-3af2-4818-bb56-b009b30ff8f5
alatyshev:eval_cosmos_ar_val |
Running | 1× | a100plus.1gpu.80vG.12C.182G | 2026-05-10 16:44 MSK | 32.7 h (running) |
lm-mpi-job-75c2315b-a2a6-4468-a201-7a030dba36ec
alatyshev:eval_cosmos_ar_val |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.182G | 2026-05-10 16:33 MSK | 4 min |
lm-mpi-job-b4161fd9-d9bb-4c62-9b80-5aba192a8a55
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-09 02:06 MSK | 1.6 h |
lm-mpi-job-c26ad29f-1eb3-4bdc-aac6-dc6a7e4f3499
phase2 cuda-lns v2 pilot postpull (HEAD 0417c8c) — 5 ids reproduce |
✓ Completed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-09 00:40 MSK | 1.2 h |
lm-mpi-job-880af590-513b-446b-bc33-8cac4f32851f
phase2 cuda-lns v2 pilot postpull (HEAD 0417c8c) — 5 ids reproduce |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-09 00:26 MSK | 13 min |
lm-mpi-job-acce7718-8ff7-441d-9272-94d5c30b9ce2
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 19:04 MSK | 8.6 h |
lm-mpi-job-a50e9dde-0227-4964-af7d-5ffa6069fa8e
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 19:04 MSK | 8.6 h |
lm-mpi-job-caa172f4-c7e8-4b71-9f7b-edc833f4d2b9
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 19:03 MSK | 8.6 h |
lm-mpi-job-d42a3b4b-a41d-453a-824f-f5b87200937b
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 19:03 MSK | 6.9 h |
lm-mpi-job-548619d3-637a-4c03-916a-1ae23c79f4a8
nsorokin-dev-2-gpu |
✓ Completed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 17:36 MSK | 3.0 h |
lm-mpi-job-48c3893e-96fc-49b9-88dd-4a03f04ddf5c
nsorokin-dev-2-gpu |
✓ Completed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:48 MSK | 3.0 h |
lm-mpi-job-7453277e-8cc1-4246-b09f-42cf132719cd
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:45 MSK | 2 min |
lm-mpi-job-b7948709-5cc0-4208-bb9b-2783cbd593cd
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:45 MSK | 2 min |
lm-mpi-job-304b8bb2-36dc-43ce-9666-6a957bb4764d
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:45 MSK | 2 min |
lm-mpi-job-c2bd1a28-9bae-40f3-8f05-64842600b721
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:45 MSK | 2 min |
lm-mpi-job-3323eedf-e558-4479-a868-df725bfd0953
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:40 MSK | 2 min |
lm-mpi-job-9afc0b22-8aaf-48db-8f80-1fd2199050f7
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:40 MSK | 2 min |
lm-mpi-job-6b436438-0b60-421e-975a-a29ca1793634
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:40 MSK | 2 min |
lm-mpi-job-2e161ffa-1233-4e46-9508-3f6f522c8f1d
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:40 MSK | 2 min |
lm-mpi-job-5cb68ee5-b986-49db-969a-a824ea4921d1
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:30 MSK | 1 min |
lm-mpi-job-29025e54-3b0c-4efa-aff4-ee0f0018794b
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:30 MSK | 1 min |
lm-mpi-job-79cc3a86-0312-4921-9d24-043e3139101f
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:30 MSK | 1 min |
lm-mpi-job-8eb576e6-71c7-4597-b298-9860f603c234
nsorokin-agent_craftext-eval |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-08 13:30 MSK | 1 min |
lm-mpi-job-b64586c5-f76b-47ac-8d3c-c8554d31a016
phase2 v2 cuda-lns persistent pilot 5 |
✓ Completed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-08 00:09 MSK | 26 min |
lm-mpi-job-25b1f14c-850c-49be-a2bf-769bd0c14a3f
phase2 v2 cuda-lns persistent pilot 5 |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-07 23:32 MSK | 1 min |
lm-mpi-job-1009f05e-9d85-4421-8601-4a5de0626f04
nsorokin-agent_craftext-train |
✓ Completed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-07 19:23 MSK | 40.0 h |
lm-mpi-job-0dfdc274-81d6-4c3f-aa99-a16cf5e42785
nsorokin-agent_craftext-train |
✓ Completed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-07 19:23 MSK | 39.8 h |
lm-mpi-job-a1b93be7-55ff-44c6-a2c8-e229afbdf42d
nsorokin-agent_craftext-train |
✓ Completed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-07 19:23 MSK | 39.9 h |
lm-mpi-job-7893d6d4-cffd-4f3b-b4a3-97037038f93e
nsorokin-agent_craftext-train |
✓ Completed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-07 19:23 MSK | 35.4 h |
lm-mpi-job-eef6c185-bf85-4f28-b9b0-edd5359ee93e
nsorokin-dev-2-gpu |
✓ Completed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-07 15:49 MSK | 5.5 h |
lm-mpi-job-0a506c19-c2e2-4ecd-b263-bd87f597ae18
alatyshev:eval_cosmos |
✗ Stopped | — | cpu.8C.32G | 2026-05-07 15:23 MSK | 51 min |
lm-mpi-job-27c8e824-606b-4a54-bab1-ce7861f0fd18
alatyshev:eval_cosmos_2_vneg10 |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-07 15:16 MSK | 58 min |
lm-mpi-job-ea0dc8b1-3b84-4b85-b0cb-59fad032157c
zvolovikova-craftext-minimal |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-06 17:25 MSK | 4.6 h |
lm-mpi-job-e5370c30-0b51-4ded-95ec-ebd042517765
zvolovikova-craftext-minimal |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-06 17:25 MSK | 3.8 h |
lm-mpi-job-d6ebb5a9-cd4e-4986-8818-022adfa4c39a
nsorokin-agent_crafter-oracle-balrog-base-cot-train |
✗ Failed | 4× | a100plus.4gpu.80vG.48C.976G | 2026-05-06 14:52 MSK | 12.7 h |
lm-mpi-job-e4e502a7-48b7-452b-96b7-0d8ad203b6d0
nsorokin-agent_crafter-balrog-min-cot-train |
✗ Failed | 2× | a100plus.2gpu.80vG.24C.488G | 2026-05-06 14:43 MSK | 5.0 h |
lm-mpi-job-e57d5800-e785-4884-a63a-43227cbbd03d
zvolovikova-craftext-minimal-gspo |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 14:58 MSK | 19.3 h |
lm-mpi-job-25102c01-72d6-498f-b2d2-187c8219a7ea
zvolovikova-craftext-minimal-gspo |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 14:57 MSK | 20.0 h |
lm-mpi-job-74554e98-4953-4f8e-a76a-5b7c1ea2350a
zvolovikova-craftext-minimal-gspo |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 14:56 MSK | 18.3 h |
lm-mpi-job-b96b58f8-ce2f-4672-b801-76948d30fad5
zvolovikova-craftext-minimal-gspo |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 14:01 MSK | 21.0 h |
lm-mpi-job-e57f27f7-0c2d-4681-82c2-38916935affa
zvolovikova-craftext-minimal-gspo |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 13:55 MSK | 6 min |
lm-mpi-job-94455db2-fa76-45ba-9887-756d7c69c6c0
zvolovikova-craftext-minimal-gspo |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 13:18 MSK | 6.7 h |
lm-mpi-job-df0d659c-d5f8-45b2-8e01-031587b226a2
zvolovikova-craftext-minimal-gspo |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 13:18 MSK | 1 min |
lm-mpi-job-d58d7897-7bd8-4b2f-a671-6a119023d962
zvolovikova-craftext-minimal-gspo |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 12:54 MSK | 16 min |
lm-mpi-job-fd771f2f-3240-4be9-b46f-20cbd719da37
zvolovikova-craftext-minimal-gspo |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 12:38 MSK | 13 min |
lm-mpi-job-77cde842-94c3-42f7-9522-a8467ed2fd12
zvolovikova-craftext-minimal-gspo |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 12:20 MSK | 12 min |
lm-mpi-job-f4503af0-5add-4a66-b68b-a2d4ca190ac3
zvolovikova-craftext-minimal-gspo |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 12:19 MSK | 1 min |
lm-mpi-job-4f945447-7ff3-46d4-9c24-58ae5ea83f36
zvolovikova-craftext-minimal-gspo |
✗ Failed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 12:15 MSK | 1 min |
lm-mpi-job-a9c9695d-3471-4c1f-b918-1816b0256bb5
phase2 cuda-lns new 777 |
✓ Completed | 1× | a100plus.1gpu.80vG.12C.244G | 2026-05-05 11:31 MSK | 41.3 h |
lm-mpi-job-31071f3e-0f72-42cc-8466-5204d4e7b481
CogAI-ugadiarov.1gpu |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-05 09:57 MSK | 55.9 h |
lm-mpi-job-148a6f44-eb37-4deb-8ee1-25adbfc07f66
CogAI-ugadiarov.1gpu |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-05 08:49 MSK | 57.0 h |
lm-mpi-job-b4956565-3242-4e31-85a5-b68af60f9b90
CogAI-ugadiarov.1gpu |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-05 08:49 MSK | 1.2 h |
lm-mpi-job-d852cffd-7d55-4d9f-951e-5eaf3dc4eeba
CogAI-ugadiarov.1gpu |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-05 08:49 MSK | 57.0 h |
lm-mpi-job-b51ae7d5-aacd-4d75-9017-5f66988baa0f
apshenitsyn corl-mppi mlspace_sweep_o10 gate1 (sf_sweep_config_o10.yaml, env_configs6_o10) |
✓ Completed | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-05 01:57 MSK | 161.2 h |
lm-mpi-job-02535550-1d9a-49ef-9e3c-4fd3fc948ff3
apshenitsyn corl-mppi mlspace_sweep_o5_gate2 (sf_sweep_config_o5_15.yaml, env_configs6_o5) |
✗ Stopped | 1× | a100plus.1gpu.80vG.12C.96G | 2026-05-05 01:52 MSK | 86.9 h |