Incidents | Cerebrium Incidents reported on status page for Cerebrium https://status.cerebrium.ai/ en US EAST 1 recovered https://status.cerebrium.ai/ Wed, 21 Jan 2026 14:39:39 +0000 https://status.cerebrium.ai/#8d5d182e50cdbdda59117176e5b6fe33abe67278cbb4e27b816a70d7c58a46aa US EAST 1 recovered US EAST 1 went down https://status.cerebrium.ai/ Wed, 21 Jan 2026 14:35:40 +0000 https://status.cerebrium.ai/#8d5d182e50cdbdda59117176e5b6fe33abe67278cbb4e27b816a70d7c58a46aa US EAST 1 went down US EAST 1 recovered https://status.cerebrium.ai/ Wed, 21 Jan 2026 14:32:58 +0000 https://status.cerebrium.ai/#08b94741a48d28ca182f1666d895730df8da04f428fa4f998696b0235951e536 US EAST 1 recovered US EAST 1 went down https://status.cerebrium.ai/ Wed, 21 Jan 2026 14:29:15 +0000 https://status.cerebrium.ai/#08b94741a48d28ca182f1666d895730df8da04f428fa4f998696b0235951e536 US EAST 1 went down US EAST 1 recovered https://status.cerebrium.ai/ Wed, 21 Jan 2026 14:20:10 +0000 https://status.cerebrium.ai/#aedeb8a13a0ce65eade12b2691fcf28dcea590f0557b97a9efde0be41adf8f4f US EAST 1 recovered US EAST 1 went down https://status.cerebrium.ai/ Wed, 21 Jan 2026 14:17:17 +0000 https://status.cerebrium.ai/#aedeb8a13a0ce65eade12b2691fcf28dcea590f0557b97a9efde0be41adf8f4f US EAST 1 went down Metrics London recovered https://status.cerebrium.ai/ Wed, 21 Jan 2026 13:29:19 +0000 https://status.cerebrium.ai/#602394198ff96e09367dafb0bdb194d70eb66bf0efb2b9b7e51372fdf7e7d0b8 Metrics London recovered Metrics London went down https://status.cerebrium.ai/ Wed, 21 Jan 2026 13:21:11 +0000 https://status.cerebrium.ai/#602394198ff96e09367dafb0bdb194d70eb66bf0efb2b9b7e51372fdf7e7d0b8 Metrics London went down Metrics Virginia recovered https://status.cerebrium.ai/ Wed, 21 Jan 2026 09:55:06 +0000 https://status.cerebrium.ai/#d8e43bc9e2f9555097f82fe246fbb396395cd65a07f26eb20dc42ddc32d2c5ed Metrics Virginia recovered Metrics Virginia went down https://status.cerebrium.ai/ Wed, 21 Jan 2026 09:52:57 +0000 https://status.cerebrium.ai/#d8e43bc9e2f9555097f82fe246fbb396395cd65a07f26eb20dc42ddc32d2c5ed Metrics Virginia went down Files US EAST 1 recovered https://status.cerebrium.ai/ Wed, 21 Jan 2026 00:24:58 +0000 https://status.cerebrium.ai/#2e5c50f89bfad1eebdc7f9ba04a18915617400c1ecee59951ac28102c87fbce5 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Wed, 21 Jan 2026 00:23:46 +0000 https://status.cerebrium.ai/#2e5c50f89bfad1eebdc7f9ba04a18915617400c1ecee59951ac28102c87fbce5 Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Tue, 20 Jan 2026 13:06:09 +0000 https://status.cerebrium.ai/#9c8d6d9230d2967ab20f049c0503749eb23f48905a96cb5d056abd18b799b8be Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Tue, 20 Jan 2026 13:04:01 +0000 https://status.cerebrium.ai/#9c8d6d9230d2967ab20f049c0503749eb23f48905a96cb5d056abd18b799b8be Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Tue, 20 Jan 2026 03:52:54 +0000 https://status.cerebrium.ai/#a3e973bcd66a72d826ba8f7e27aae1ebcaa6b16fae6d37427473d3f924b9aaaa Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Tue, 20 Jan 2026 03:51:04 +0000 https://status.cerebrium.ai/#a3e973bcd66a72d826ba8f7e27aae1ebcaa6b16fae6d37427473d3f924b9aaaa Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Mon, 19 Jan 2026 15:55:02 +0000 https://status.cerebrium.ai/#390cb23ecdad0b190a47ae0950691d3999b354305363eb10131464d81e4ae861 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Mon, 19 Jan 2026 15:53:49 +0000 https://status.cerebrium.ai/#390cb23ecdad0b190a47ae0950691d3999b354305363eb10131464d81e4ae861 Files US EAST 1 went down Metrics Virginia recovered https://status.cerebrium.ai/ Mon, 19 Jan 2026 10:13:09 +0000 https://status.cerebrium.ai/#d338e33ab751da2661955d819fb93322631f430b7804861f9db3d49c0459b71e Metrics Virginia recovered Metrics Virginia went down https://status.cerebrium.ai/ Mon, 19 Jan 2026 09:32:00 +0000 https://status.cerebrium.ai/#d338e33ab751da2661955d819fb93322631f430b7804861f9db3d49c0459b71e Metrics Virginia went down US EAST 1 recovered https://status.cerebrium.ai/ Fri, 16 Jan 2026 09:55:11 +0000 https://status.cerebrium.ai/#cc98650f358992b04bfe11e50aa09c5c66d0c5681e29b667ce36caada3312e09 US EAST 1 recovered US EAST 1 went down https://status.cerebrium.ai/ Fri, 16 Jan 2026 09:52:48 +0000 https://status.cerebrium.ai/#cc98650f358992b04bfe11e50aa09c5c66d0c5681e29b667ce36caada3312e09 US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Thu, 15 Jan 2026 21:57:55 +0000 https://status.cerebrium.ai/#2b00a94005438c5bd845a61c419456fa471e36486105c911657d23067191724c Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Thu, 15 Jan 2026 21:56:48 +0000 https://status.cerebrium.ai/#2b00a94005438c5bd845a61c419456fa471e36486105c911657d23067191724c Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Thu, 15 Jan 2026 19:21:58 +0000 https://status.cerebrium.ai/#6d40a220f7033138aae508b8a557f672ac19d1417fa507de0ccc13baa350ce05 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Thu, 15 Jan 2026 19:20:48 +0000 https://status.cerebrium.ai/#6d40a220f7033138aae508b8a557f672ac19d1417fa507de0ccc13baa350ce05 Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Thu, 15 Jan 2026 19:15:59 +0000 https://status.cerebrium.ai/#2d4a9c274ede5d09199ec07aa07cca7a27b8b8c5855ad0e78c66171a75fbaed3 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Thu, 15 Jan 2026 19:14:48 +0000 https://status.cerebrium.ai/#2d4a9c274ede5d09199ec07aa07cca7a27b8b8c5855ad0e78c66171a75fbaed3 Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Thu, 15 Jan 2026 16:35:58 +0000 https://status.cerebrium.ai/#4076b90f97c1b75b15c2a70bd8a138c49435805b7b8f1aafc4ab6b398b3916a2 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Thu, 15 Jan 2026 16:33:02 +0000 https://status.cerebrium.ai/#4076b90f97c1b75b15c2a70bd8a138c49435805b7b8f1aafc4ab6b398b3916a2 Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Wed, 14 Jan 2026 11:32:54 +0000 https://status.cerebrium.ai/#31720168246f81bd1ccbb23e20e1bce3a384d77746bdfc63b8d10b0a641ecdae Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Wed, 14 Jan 2026 11:30:53 +0000 https://status.cerebrium.ai/#31720168246f81bd1ccbb23e20e1bce3a384d77746bdfc63b8d10b0a641ecdae Files US EAST 1 went down Increase in request queuing on AWS workloads https://status.cerebrium.ai/incident/802912 Mon, 12 Jan 2026 09:00:00 -0000 https://status.cerebrium.ai/incident/802912#7ab384d4aba5d5dcf8222a210d787a36433e8c2ff3171bf5f85060e21b8cd863 We're currently experiencing degraded performance on workloads being scheduled to the AWS provider. This issue currently only affects GPU-based workloads. This issue is intermittent and may not be affecting all apps. The team is currently investigating the issue and we will provide an update as we uncover any new information. Files US EAST 1 recovered https://status.cerebrium.ai/ Mon, 05 Jan 2026 12:46:55 +0000 https://status.cerebrium.ai/#7dd5520e75c68c7026417c3deaa39a31b6893379ded71206fd858db12024443a Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Mon, 05 Jan 2026 12:45:50 +0000 https://status.cerebrium.ai/#7dd5520e75c68c7026417c3deaa39a31b6893379ded71206fd858db12024443a Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Sun, 28 Dec 2025 23:24:04 +0000 https://status.cerebrium.ai/#514a53364b978f303095f79d7c1507aa32e11eaa00800343f4ec9f2b334ccf9b Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Sun, 28 Dec 2025 23:22:52 +0000 https://status.cerebrium.ai/#514a53364b978f303095f79d7c1507aa32e11eaa00800343f4ec9f2b334ccf9b Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Thu, 25 Dec 2025 12:19:00 +0000 https://status.cerebrium.ai/#90c274fb25cb7c1883e9724b858f3a723376b7492fbcad35b0f74fe601ee1608 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Thu, 25 Dec 2025 12:17:51 +0000 https://status.cerebrium.ai/#90c274fb25cb7c1883e9724b858f3a723376b7492fbcad35b0f74fe601ee1608 Files US EAST 1 went down Metrics Virginia recovered https://status.cerebrium.ai/ Tue, 23 Dec 2025 07:56:05 +0000 https://status.cerebrium.ai/#2cb487da41b9d353639197c64a7c1e4f03b9e86b8b17dfc0ea4265838a9c9f41 Metrics Virginia recovered Metrics Virginia went down https://status.cerebrium.ai/ Tue, 23 Dec 2025 07:54:03 +0000 https://status.cerebrium.ai/#2cb487da41b9d353639197c64a7c1e4f03b9e86b8b17dfc0ea4265838a9c9f41 Metrics Virginia went down Metrics Virginia recovered https://status.cerebrium.ai/ Sat, 20 Dec 2025 15:47:15 +0000 https://status.cerebrium.ai/#774b60b004dcb32f00da683ea7f2dd5a2e36bc4d780b0d423acd7546b3db98cb Metrics Virginia recovered Metrics Virginia went down https://status.cerebrium.ai/ Sat, 20 Dec 2025 15:46:01 +0000 https://status.cerebrium.ai/#774b60b004dcb32f00da683ea7f2dd5a2e36bc4d780b0d423acd7546b3db98cb Metrics Virginia went down Files US EAST 1 recovered https://status.cerebrium.ai/ Fri, 12 Dec 2025 19:40:56 +0000 https://status.cerebrium.ai/#1be099dc0be1929e724639e0311e56e3e34d13d6374ec43cce6a0c2195f30f51 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Fri, 12 Dec 2025 19:39:53 +0000 https://status.cerebrium.ai/#1be099dc0be1929e724639e0311e56e3e34d13d6374ec43cce6a0c2195f30f51 Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Fri, 12 Dec 2025 07:11:52 +0000 https://status.cerebrium.ai/#d6c2563559e36f1a94a7cf90f074423652527b38cc136392f7a5656b60b438c7 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Fri, 12 Dec 2025 07:10:48 +0000 https://status.cerebrium.ai/#d6c2563559e36f1a94a7cf90f074423652527b38cc136392f7a5656b60b438c7 Files US EAST 1 went down Build Service recovered https://status.cerebrium.ai/ Thu, 11 Dec 2025 17:20:07 +0000 https://status.cerebrium.ai/#79c1c5abec96183e1b13900968d05ce0a2b29536c872f28935c1cf84c8af46b6 Build Service recovered Build Service went down https://status.cerebrium.ai/ Thu, 11 Dec 2025 15:00:50 +0000 https://status.cerebrium.ai/#79c1c5abec96183e1b13900968d05ce0a2b29536c872f28935c1cf84c8af46b6 Build Service went down Build Service recovered https://status.cerebrium.ai/ Wed, 10 Dec 2025 15:52:08 +0000 https://status.cerebrium.ai/#0fb06ca99667e548486cf3d060859b954829dd6171f7ccf37b8b71c246e984fc Build Service recovered Build Service went down https://status.cerebrium.ai/ Wed, 10 Dec 2025 14:18:53 +0000 https://status.cerebrium.ai/#0fb06ca99667e548486cf3d060859b954829dd6171f7ccf37b8b71c246e984fc Build Service went down Problem starting new workloads. Existing apps are unaffected. https://status.cerebrium.ai/incident/783164 Tue, 09 Dec 2025 19:08:00 -0000 https://status.cerebrium.ai/incident/783164#3a148ec3d4aca0a7568662b6de72a63d3307b7004630d8db4f39a2d78be6ec4c The issue has been resolved Problem starting new workloads. Existing apps are unaffected. https://status.cerebrium.ai/incident/783164 Tue, 09 Dec 2025 18:44:00 -0000 https://status.cerebrium.ai/incident/783164#830358f004199aa5af28e313f89f76798f7c9008f45ffd0d748217510683a6ce New apps are unable to start at present. Files US EAST 1 recovered https://status.cerebrium.ai/ Mon, 08 Dec 2025 11:04:31 +0000 https://status.cerebrium.ai/#871f118934ca537491d4800292e9f5d494e7f947d2275ed02188cba5fa668c3b Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Mon, 08 Dec 2025 11:02:32 +0000 https://status.cerebrium.ai/#871f118934ca537491d4800292e9f5d494e7f947d2275ed02188cba5fa668c3b Files US EAST 1 went down Files US EAST 1 recovered https://status.cerebrium.ai/ Wed, 03 Dec 2025 05:51:38 +0000 https://status.cerebrium.ai/#c628ec99bff4d41c47c7f7a9bed3eac0cbc77dc175f02d564f627f507bb06c4b Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Wed, 03 Dec 2025 05:49:37 +0000 https://status.cerebrium.ai/#c628ec99bff4d41c47c7f7a9bed3eac0cbc77dc175f02d564f627f507bb06c4b Files US EAST 1 went down Elevated Errors in US-East-1 https://status.cerebrium.ai/incident/778505 Tue, 02 Dec 2025 23:54:00 -0000 https://status.cerebrium.ai/incident/778505#9a6a4a594b4a98c27a6518f481d7a24a1c5d001b1b7369a32cd3ff823a3829aa Our platform is current struggling to schedule new containers on incoming requests. Our team is working on identifying the error and resolving ASAP Resolved: The issue was caused by a failure in a managed component from one of our infrastructure providers, which temporarily prevented us from scheduling new capacity. We’ve worked with the provider to restore functionality and are now implementing additional safeguards to ensure this does not recur. Build Service recovered https://status.cerebrium.ai/ Mon, 01 Dec 2025 22:42:24 +0000 https://status.cerebrium.ai/#32cd06e23998e6c5a7a92d093299f5093707fc74c66df9ac065b908e46578ae6 Build Service recovered Build Service went down https://status.cerebrium.ai/ Mon, 01 Dec 2025 20:52:24 +0000 https://status.cerebrium.ai/#32cd06e23998e6c5a7a92d093299f5093707fc74c66df9ac065b908e46578ae6 Build Service went down Build Service recovered https://status.cerebrium.ai/ Wed, 26 Nov 2025 05:06:19 +0000 https://status.cerebrium.ai/#fa17132d836d581a360de22b1b2be135cd90a18211d1d89813ccd43f115dbe6c Build Service recovered Build Service went down https://status.cerebrium.ai/ Wed, 26 Nov 2025 03:48:17 +0000 https://status.cerebrium.ai/#fa17132d836d581a360de22b1b2be135cd90a18211d1d89813ccd43f115dbe6c Build Service went down Files US EAST 1 recovered https://status.cerebrium.ai/ Mon, 24 Nov 2025 03:41:58 +0000 https://status.cerebrium.ai/#c4c12b95e84849d8a33578b12fe4913562918f523c26cd2666bf8f58163cb401 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Mon, 24 Nov 2025 03:35:17 +0000 https://status.cerebrium.ai/#c4c12b95e84849d8a33578b12fe4913562918f523c26cd2666bf8f58163cb401 Files US EAST 1 went down Metrics Virginia recovered https://status.cerebrium.ai/ Thu, 20 Nov 2025 15:48:48 +0000 https://status.cerebrium.ai/#68f8fafa958330f69e5d0194a5e1c21da3c2a3275c77e2b8a7aec726bc068eb0 Metrics Virginia recovered Metrics Virginia went down https://status.cerebrium.ai/ Thu, 20 Nov 2025 15:46:47 +0000 https://status.cerebrium.ai/#68f8fafa958330f69e5d0194a5e1c21da3c2a3275c77e2b8a7aec726bc068eb0 Metrics Virginia went down Files US EAST 1 recovered https://status.cerebrium.ai/ Thu, 20 Nov 2025 13:40:47 +0000 https://status.cerebrium.ai/#31a158e90c025e4e430ca0a378505d474ef29c23c9f7fd83baa851ddf1870d98 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Thu, 20 Nov 2025 13:39:46 +0000 https://status.cerebrium.ai/#31a158e90c025e4e430ca0a378505d474ef29c23c9f7fd83baa851ddf1870d98 Files US EAST 1 went down Files EU WEST 2 recovered https://status.cerebrium.ai/ Wed, 19 Nov 2025 20:02:08 +0000 https://status.cerebrium.ai/#c43fb14d65911de5134c7409030eb6e200614a1f5dd7c9a92788efd845c75682 Files EU WEST 2 recovered Files EU WEST 2 went down https://status.cerebrium.ai/ Wed, 19 Nov 2025 20:01:07 +0000 https://status.cerebrium.ai/#c43fb14d65911de5134c7409030eb6e200614a1f5dd7c9a92788efd845c75682 Files EU WEST 2 went down Build Service recovered https://status.cerebrium.ai/ Tue, 18 Nov 2025 21:41:59 +0000 https://status.cerebrium.ai/#786f3d1ece57187d2bc4f63b40d14feaae6459f9ca4835acb5ff4c592b805ddf Build Service recovered Build Service went down https://status.cerebrium.ai/ Tue, 18 Nov 2025 20:35:59 +0000 https://status.cerebrium.ai/#786f3d1ece57187d2bc4f63b40d14feaae6459f9ca4835acb5ff4c592b805ddf Build Service went down Build Service recovered https://status.cerebrium.ai/ Tue, 18 Nov 2025 01:16:50 +0000 https://status.cerebrium.ai/#0c6a6735024cf0be111bfea916a5324e485e067d0788ebb8d47837a56a389d08 Build Service recovered Build Service went down https://status.cerebrium.ai/ Tue, 18 Nov 2025 00:12:51 +0000 https://status.cerebrium.ai/#0c6a6735024cf0be111bfea916a5324e485e067d0788ebb8d47837a56a389d08 Build Service went down Metrics Virginia recovered https://status.cerebrium.ai/ Mon, 17 Nov 2025 00:41:27 +0000 https://status.cerebrium.ai/#d640dea1fa1c32f0251906cecc06fb6cab82d60f81dc54c1197e8f762da4a495 Metrics Virginia recovered Metrics Virginia went down https://status.cerebrium.ai/ Mon, 17 Nov 2025 00:39:26 +0000 https://status.cerebrium.ai/#d640dea1fa1c32f0251906cecc06fb6cab82d60f81dc54c1197e8f762da4a495 Metrics Virginia went down Updating various cluster components https://status.cerebrium.ai/incident/765784 Sun, 16 Nov 2025 16:10:08 -0000 https://status.cerebrium.ai/incident/765784#f0f4af9806ae2db765672f802fe14ce584f8859c47d6ec9d7295cf618a4fa6a1 Maintenance completed Updating various cluster components https://status.cerebrium.ai/incident/765784 Sun, 16 Nov 2025 15:35:00 -0000 https://status.cerebrium.ai/incident/765784#44c162f4b64153670bac6f17c25bfa4e676dc9f436b6a01c2f3a84cc52e0defd We are performing a series of infrastructure optimizations to improve performance and reliability. While we don’t expect customer traffic to be impacted, there may be brief periods of elevated latency or volatility during the upgrade window. Our team is closely monitoring the rollout and will update this page with any relevant changes. Updating various cluster components https://status.cerebrium.ai/incident/765784 Sun, 16 Nov 2025 15:35:00 -0000 https://status.cerebrium.ai/incident/765784#44c162f4b64153670bac6f17c25bfa4e676dc9f436b6a01c2f3a84cc52e0defd We are performing a series of infrastructure optimizations to improve performance and reliability. While we don’t expect customer traffic to be impacted, there may be brief periods of elevated latency or volatility during the upgrade window. Our team is closely monitoring the rollout and will update this page with any relevant changes. Files US EAST 1 recovered https://status.cerebrium.ai/ Thu, 13 Nov 2025 18:30:50 +0000 https://status.cerebrium.ai/#de3737269e01d2808921404705134115a8405abcb300a44ef5e572157e1485ef Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Thu, 13 Nov 2025 17:44:33 +0000 https://status.cerebrium.ai/#de3737269e01d2808921404705134115a8405abcb300a44ef5e572157e1485ef Files US EAST 1 went down Metrics Virginia recovered https://status.cerebrium.ai/ Thu, 13 Nov 2025 12:20:21 +0000 https://status.cerebrium.ai/#8600cd2fc128cc7f5a7d073e4a00ae22acbe7729c68ee0b9b46db353ced4ba25 Metrics Virginia recovered Metrics Virginia went down https://status.cerebrium.ai/ Thu, 13 Nov 2025 12:04:20 +0000 https://status.cerebrium.ai/#8600cd2fc128cc7f5a7d073e4a00ae22acbe7729c68ee0b9b46db353ced4ba25 Metrics Virginia went down Metrics Virginia recovered https://status.cerebrium.ai/ Thu, 13 Nov 2025 12:03:20 +0000 https://status.cerebrium.ai/#2121d52903efd924ecec86ad375fcf84fe483969b3552d0566a1cc772c2f4b9f Metrics Virginia recovered Metrics Virginia went down https://status.cerebrium.ai/ Thu, 13 Nov 2025 11:57:20 +0000 https://status.cerebrium.ai/#2121d52903efd924ecec86ad375fcf84fe483969b3552d0566a1cc772c2f4b9f Metrics Virginia went down Metrics Virginia recovered https://status.cerebrium.ai/ Mon, 10 Nov 2025 20:13:02 +0000 https://status.cerebrium.ai/#ee11b170a8b4ef24d837b9903ee8eb4031793c6647d7b2a54a1eb1d91dd38fcf Metrics Virginia recovered Metrics Virginia went down https://status.cerebrium.ai/ Mon, 10 Nov 2025 20:09:02 +0000 https://status.cerebrium.ai/#ee11b170a8b4ef24d837b9903ee8eb4031793c6647d7b2a54a1eb1d91dd38fcf Metrics Virginia went down Files US EAST 1 recovered https://status.cerebrium.ai/ Sun, 09 Nov 2025 03:16:38 +0000 https://status.cerebrium.ai/#6ad883ffe0d0cba8843f4c9e9e0cbd51760d6a05112e33f555b9cc1949fffd84 Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Sun, 09 Nov 2025 03:14:38 +0000 https://status.cerebrium.ai/#6ad883ffe0d0cba8843f4c9e9e0cbd51760d6a05112e33f555b9cc1949fffd84 Files US EAST 1 went down Metrics Virginia recovered https://status.cerebrium.ai/ Fri, 07 Nov 2025 14:43:21 +0000 https://status.cerebrium.ai/#27101d35fb3c6474c57afc19389c42105536a96117e76191cf654caab238e3c0 Metrics Virginia recovered Files US EAST 1 recovered https://status.cerebrium.ai/ Fri, 07 Nov 2025 14:42:13 +0000 https://status.cerebrium.ai/#b4405b26165d0223598da93aad73a4ab52fa685bbf25f775257e85ad175e4bae Files US EAST 1 recovered Metrics Virginia went down https://status.cerebrium.ai/ Fri, 07 Nov 2025 14:41:22 +0000 https://status.cerebrium.ai/#27101d35fb3c6474c57afc19389c42105536a96117e76191cf654caab238e3c0 Metrics Virginia went down Files US EAST 1 went down https://status.cerebrium.ai/ Fri, 07 Nov 2025 14:41:11 +0000 https://status.cerebrium.ai/#b4405b26165d0223598da93aad73a4ab52fa685bbf25f775257e85ad175e4bae Files US EAST 1 went down Emergency node maintenance in US-East-1 https://status.cerebrium.ai/incident/757186 Tue, 04 Nov 2025 04:27:01 -0000 https://status.cerebrium.ai/incident/757186#ae296967f69932c9ad3c7b13a49966bcdbd91fe0028b42d0a9a790585ae79ed6 Maintenance completed Emergency node maintenance in US-East-1 https://status.cerebrium.ai/incident/757186 Tue, 04 Nov 2025 04:27:01 -0000 https://status.cerebrium.ai/incident/757186#ae296967f69932c9ad3c7b13a49966bcdbd91fe0028b42d0a9a790585ae79ed6 Maintenance completed Emergency node maintenance in US-East-1 https://status.cerebrium.ai/incident/757186 Tue, 04 Nov 2025 04:00:34 -0000 https://status.cerebrium.ai/incident/757186#d47ae91f32582e55a5a2dcc9e6bc40e24a2191052cb85532b3e4de37ecdcefe7 A critical error in the mechanism GPU devices use to attach to containers is affecting several workloads on the platform, causing NVML to show "Device not found" when calling nvidia-smi or attempting to use the GPU (Mentioned in https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/troubleshooting.html#containers-losing-access-to-gpus-with-error-failed-to-initialize-nvml-unknown-error). This maintenance will update all GPU nodes to use the CDI, as well as a few container runtime upgrades. Emergency node maintenance in US-East-1 https://status.cerebrium.ai/incident/757186 Tue, 04 Nov 2025 04:00:34 -0000 https://status.cerebrium.ai/incident/757186#d47ae91f32582e55a5a2dcc9e6bc40e24a2191052cb85532b3e4de37ecdcefe7 A critical error in the mechanism GPU devices use to attach to containers is affecting several workloads on the platform, causing NVML to show "Device not found" when calling nvidia-smi or attempting to use the GPU (Mentioned in https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/troubleshooting.html#containers-losing-access-to-gpus-with-error-failed-to-initialize-nvml-unknown-error). This maintenance will update all GPU nodes to use the CDI, as well as a few container runtime upgrades. Emergency node maintenance in US-East-1 https://status.cerebrium.ai/incident/757186 Tue, 04 Nov 2025 04:00:34 -0000 https://status.cerebrium.ai/incident/757186#d47ae91f32582e55a5a2dcc9e6bc40e24a2191052cb85532b3e4de37ecdcefe7 A critical error in the mechanism GPU devices use to attach to containers is affecting several workloads on the platform, causing NVML to show "Device not found" when calling nvidia-smi or attempting to use the GPU (Mentioned in https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/troubleshooting.html#containers-losing-access-to-gpus-with-error-failed-to-initialize-nvml-unknown-error). This maintenance will update all GPU nodes to use the CDI, as well as a few container runtime upgrades. Emergency node maintenance in US-East-1 https://status.cerebrium.ai/incident/757186 Tue, 04 Nov 2025 04:00:34 -0000 https://status.cerebrium.ai/incident/757186#d47ae91f32582e55a5a2dcc9e6bc40e24a2191052cb85532b3e4de37ecdcefe7 A critical error in the mechanism GPU devices use to attach to containers is affecting several workloads on the platform, causing NVML to show "Device not found" when calling nvidia-smi or attempting to use the GPU (Mentioned in https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/troubleshooting.html#containers-losing-access-to-gpus-with-error-failed-to-initialize-nvml-unknown-error). This maintenance will update all GPU nodes to use the CDI, as well as a few container runtime upgrades. Emergency node maintenance in US-East-1 https://status.cerebrium.ai/incident/757186 Tue, 04 Nov 2025 04:00:34 -0000 https://status.cerebrium.ai/incident/757186#d47ae91f32582e55a5a2dcc9e6bc40e24a2191052cb85532b3e4de37ecdcefe7 A critical error in the mechanism GPU devices use to attach to containers is affecting several workloads on the platform, causing NVML to show "Device not found" when calling nvidia-smi or attempting to use the GPU (Mentioned in https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/troubleshooting.html#containers-losing-access-to-gpus-with-error-failed-to-initialize-nvml-unknown-error). This maintenance will update all GPU nodes to use the CDI, as well as a few container runtime upgrades. Emergency node maintenance in US-East-1 https://status.cerebrium.ai/incident/757186 Tue, 04 Nov 2025 04:00:34 -0000 https://status.cerebrium.ai/incident/757186#d47ae91f32582e55a5a2dcc9e6bc40e24a2191052cb85532b3e4de37ecdcefe7 A critical error in the mechanism GPU devices use to attach to containers is affecting several workloads on the platform, causing NVML to show "Device not found" when calling nvidia-smi or attempting to use the GPU (Mentioned in https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/latest/troubleshooting.html#containers-losing-access-to-gpus-with-error-failed-to-initialize-nvml-unknown-error). This maintenance will update all GPU nodes to use the CDI, as well as a few container runtime upgrades. Files US EAST 1 recovered https://status.cerebrium.ai/ Tue, 28 Oct 2025 22:23:35 +0000 https://status.cerebrium.ai/#eea1856546de77ae01493e30aec6a89fd62a0a907b49e252343b6208d43943ca Files US EAST 1 recovered Files US EAST 1 went down https://status.cerebrium.ai/ Tue, 28 Oct 2025 22:22:34 +0000 https://status.cerebrium.ai/#eea1856546de77ae01493e30aec6a89fd62a0a907b49e252343b6208d43943ca Files US EAST 1 went down Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 23:36:00 -0000 https://status.cerebrium.ai/incident/746816#def6d05d3ec66619875cc72f480e59b5e4fc16b651f34fefe86783f281a574ad Resolved Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 23:36:00 -0000 https://status.cerebrium.ai/incident/746816#def6d05d3ec66619875cc72f480e59b5e4fc16b651f34fefe86783f281a574ad Resolved Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 23:36:00 -0000 https://status.cerebrium.ai/incident/746816#def6d05d3ec66619875cc72f480e59b5e4fc16b651f34fefe86783f281a574ad Resolved Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 23:36:00 -0000 https://status.cerebrium.ai/incident/746816#def6d05d3ec66619875cc72f480e59b5e4fc16b651f34fefe86783f281a574ad Resolved Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 23:36:00 -0000 https://status.cerebrium.ai/incident/746816#def6d05d3ec66619875cc72f480e59b5e4fc16b651f34fefe86783f281a574ad Resolved Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 23:36:00 -0000 https://status.cerebrium.ai/incident/746816#def6d05d3ec66619875cc72f480e59b5e4fc16b651f34fefe86783f281a574ad Resolved Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 19:17:00 -0000 https://status.cerebrium.ai/incident/746816#c2796bb6bcf9aeb2ead994d0816196a44f82c71df6e0c874a4db8826faec0b59 We continue to observe recovery across all AWS services, and instance launches are succeeding across multiple Availability Zones in the US-EAST-1 Regions Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 19:17:00 -0000 https://status.cerebrium.ai/incident/746816#c2796bb6bcf9aeb2ead994d0816196a44f82c71df6e0c874a4db8826faec0b59 We continue to observe recovery across all AWS services, and instance launches are succeeding across multiple Availability Zones in the US-EAST-1 Regions Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 19:17:00 -0000 https://status.cerebrium.ai/incident/746816#c2796bb6bcf9aeb2ead994d0816196a44f82c71df6e0c874a4db8826faec0b59 We continue to observe recovery across all AWS services, and instance launches are succeeding across multiple Availability Zones in the US-EAST-1 Regions Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 19:17:00 -0000 https://status.cerebrium.ai/incident/746816#c2796bb6bcf9aeb2ead994d0816196a44f82c71df6e0c874a4db8826faec0b59 We continue to observe recovery across all AWS services, and instance launches are succeeding across multiple Availability Zones in the US-EAST-1 Regions Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 19:17:00 -0000 https://status.cerebrium.ai/incident/746816#c2796bb6bcf9aeb2ead994d0816196a44f82c71df6e0c874a4db8826faec0b59 We continue to observe recovery across all AWS services, and instance launches are succeeding across multiple Availability Zones in the US-EAST-1 Regions Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 19:17:00 -0000 https://status.cerebrium.ai/incident/746816#c2796bb6bcf9aeb2ead994d0816196a44f82c71df6e0c874a4db8826faec0b59 We continue to observe recovery across all AWS services, and instance launches are succeeding across multiple Availability Zones in the US-EAST-1 Regions Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 18:24:00 -0000 https://status.cerebrium.ai/incident/746816#08caa7295d5150a3985a744fff62124415b3fe892f24c413ded26bc73486cbcb AWS's mitigations to resolve launch failures for new EC2 instances continue to progress and we are seeing increased launches of new EC2 instances. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 18:24:00 -0000 https://status.cerebrium.ai/incident/746816#08caa7295d5150a3985a744fff62124415b3fe892f24c413ded26bc73486cbcb AWS's mitigations to resolve launch failures for new EC2 instances continue to progress and we are seeing increased launches of new EC2 instances. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 18:24:00 -0000 https://status.cerebrium.ai/incident/746816#08caa7295d5150a3985a744fff62124415b3fe892f24c413ded26bc73486cbcb AWS's mitigations to resolve launch failures for new EC2 instances continue to progress and we are seeing increased launches of new EC2 instances. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 18:24:00 -0000 https://status.cerebrium.ai/incident/746816#08caa7295d5150a3985a744fff62124415b3fe892f24c413ded26bc73486cbcb AWS's mitigations to resolve launch failures for new EC2 instances continue to progress and we are seeing increased launches of new EC2 instances. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 18:24:00 -0000 https://status.cerebrium.ai/incident/746816#08caa7295d5150a3985a744fff62124415b3fe892f24c413ded26bc73486cbcb AWS's mitigations to resolve launch failures for new EC2 instances continue to progress and we are seeing increased launches of new EC2 instances. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 18:24:00 -0000 https://status.cerebrium.ai/incident/746816#08caa7295d5150a3985a744fff62124415b3fe892f24c413ded26bc73486cbcb AWS's mitigations to resolve launch failures for new EC2 instances continue to progress and we are seeing increased launches of new EC2 instances. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:48:00 -0000 https://status.cerebrium.ai/incident/746816#63d5cc546455aa540f4c11553e1ee571569501a82cc40bf8190db9cf776ad430 AWS have resolved launch failures and are rolling out the changes to all AZ's at which point we expect launch errors and network connectivity issues to subside. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:48:00 -0000 https://status.cerebrium.ai/incident/746816#63d5cc546455aa540f4c11553e1ee571569501a82cc40bf8190db9cf776ad430 AWS have resolved launch failures and are rolling out the changes to all AZ's at which point we expect launch errors and network connectivity issues to subside. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:48:00 -0000 https://status.cerebrium.ai/incident/746816#63d5cc546455aa540f4c11553e1ee571569501a82cc40bf8190db9cf776ad430 AWS have resolved launch failures and are rolling out the changes to all AZ's at which point we expect launch errors and network connectivity issues to subside. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:48:00 -0000 https://status.cerebrium.ai/incident/746816#63d5cc546455aa540f4c11553e1ee571569501a82cc40bf8190db9cf776ad430 AWS have resolved launch failures and are rolling out the changes to all AZ's at which point we expect launch errors and network connectivity issues to subside. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:48:00 -0000 https://status.cerebrium.ai/incident/746816#63d5cc546455aa540f4c11553e1ee571569501a82cc40bf8190db9cf776ad430 AWS have resolved launch failures and are rolling out the changes to all AZ's at which point we expect launch errors and network connectivity issues to subside. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:48:00 -0000 https://status.cerebrium.ai/incident/746816#63d5cc546455aa540f4c11553e1ee571569501a82cc40bf8190db9cf776ad430 AWS have resolved launch failures and are rolling out the changes to all AZ's at which point we expect launch errors and network connectivity issues to subside. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:04:00 -0000 https://status.cerebrium.ai/incident/746816#0344617f774b7af62a0b35ad079fe58cd65549059c279b494e519764a530a924 AWS is in the process of validating a fix for EC2 launches and will deploy to the first AZ as soon as they have confidence we can do so safely. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:04:00 -0000 https://status.cerebrium.ai/incident/746816#0344617f774b7af62a0b35ad079fe58cd65549059c279b494e519764a530a924 AWS is in the process of validating a fix for EC2 launches and will deploy to the first AZ as soon as they have confidence we can do so safely. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:04:00 -0000 https://status.cerebrium.ai/incident/746816#0344617f774b7af62a0b35ad079fe58cd65549059c279b494e519764a530a924 AWS is in the process of validating a fix for EC2 launches and will deploy to the first AZ as soon as they have confidence we can do so safely. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:04:00 -0000 https://status.cerebrium.ai/incident/746816#0344617f774b7af62a0b35ad079fe58cd65549059c279b494e519764a530a924 AWS is in the process of validating a fix for EC2 launches and will deploy to the first AZ as soon as they have confidence we can do so safely. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:04:00 -0000 https://status.cerebrium.ai/incident/746816#0344617f774b7af62a0b35ad079fe58cd65549059c279b494e519764a530a924 AWS is in the process of validating a fix for EC2 launches and will deploy to the first AZ as soon as they have confidence we can do so safely. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 17:04:00 -0000 https://status.cerebrium.ai/incident/746816#0344617f774b7af62a0b35ad079fe58cd65549059c279b494e519764a530a924 AWS is in the process of validating a fix for EC2 launches and will deploy to the first AZ as soon as they have confidence we can do so safely. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 15:47:00 -0000 https://status.cerebrium.ai/incident/746816#869f54ebbeff72d23a7f83e3ee9b40b543149323c691df5acef5f39efa5e3be7 AWS have narrowed down the source of the network connectivity issues that have impacted their services. They are throttling requests for new EC2 instance launches to aid recovery and actively working on mitigations. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 15:47:00 -0000 https://status.cerebrium.ai/incident/746816#869f54ebbeff72d23a7f83e3ee9b40b543149323c691df5acef5f39efa5e3be7 AWS have narrowed down the source of the network connectivity issues that have impacted their services. They are throttling requests for new EC2 instance launches to aid recovery and actively working on mitigations. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 15:47:00 -0000 https://status.cerebrium.ai/incident/746816#869f54ebbeff72d23a7f83e3ee9b40b543149323c691df5acef5f39efa5e3be7 AWS have narrowed down the source of the network connectivity issues that have impacted their services. They are throttling requests for new EC2 instance launches to aid recovery and actively working on mitigations. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 15:47:00 -0000 https://status.cerebrium.ai/incident/746816#869f54ebbeff72d23a7f83e3ee9b40b543149323c691df5acef5f39efa5e3be7 AWS have narrowed down the source of the network connectivity issues that have impacted their services. They are throttling requests for new EC2 instance launches to aid recovery and actively working on mitigations. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 15:47:00 -0000 https://status.cerebrium.ai/incident/746816#869f54ebbeff72d23a7f83e3ee9b40b543149323c691df5acef5f39efa5e3be7 AWS have narrowed down the source of the network connectivity issues that have impacted their services. They are throttling requests for new EC2 instance launches to aid recovery and actively working on mitigations. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 15:47:00 -0000 https://status.cerebrium.ai/incident/746816#869f54ebbeff72d23a7f83e3ee9b40b543149323c691df5acef5f39efa5e3be7 AWS have narrowed down the source of the network connectivity issues that have impacted their services. They are throttling requests for new EC2 instance launches to aid recovery and actively working on mitigations. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 14:01:00 -0000 https://status.cerebrium.ai/incident/746816#55398d34ca052b66a663b8a6fafb6229c9d65baf791efe2ae6318d5cc992ecff AWS has applied fixes but is still experiencing problems launching instances in us-east-1. Builds and endpoint calls remain broken. We'll keep you posted. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 14:01:00 -0000 https://status.cerebrium.ai/incident/746816#55398d34ca052b66a663b8a6fafb6229c9d65baf791efe2ae6318d5cc992ecff AWS has applied fixes but is still experiencing problems launching instances in us-east-1. Builds and endpoint calls remain broken. We'll keep you posted. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 14:01:00 -0000 https://status.cerebrium.ai/incident/746816#55398d34ca052b66a663b8a6fafb6229c9d65baf791efe2ae6318d5cc992ecff AWS has applied fixes but is still experiencing problems launching instances in us-east-1. Builds and endpoint calls remain broken. We'll keep you posted. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 14:01:00 -0000 https://status.cerebrium.ai/incident/746816#55398d34ca052b66a663b8a6fafb6229c9d65baf791efe2ae6318d5cc992ecff AWS has applied fixes but is still experiencing problems launching instances in us-east-1. Builds and endpoint calls remain broken. We'll keep you posted. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 14:01:00 -0000 https://status.cerebrium.ai/incident/746816#55398d34ca052b66a663b8a6fafb6229c9d65baf791efe2ae6318d5cc992ecff AWS has applied fixes but is still experiencing problems launching instances in us-east-1. Builds and endpoint calls remain broken. We'll keep you posted. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 14:01:00 -0000 https://status.cerebrium.ai/incident/746816#55398d34ca052b66a663b8a6fafb6229c9d65baf791efe2ae6318d5cc992ecff AWS has applied fixes but is still experiencing problems launching instances in us-east-1. Builds and endpoint calls remain broken. We'll keep you posted. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 13:28:00 -0000 https://status.cerebrium.ai/incident/746816#67599f05df32c991402005470e8eaf57294cf54e0e8c0e1a09a50c5bef88da37 The AWS outage is ongoing. Builds are currently broken due to an outage with EC2. We're waiting on AWS to resolve the issue and will keep you updated. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 13:28:00 -0000 https://status.cerebrium.ai/incident/746816#67599f05df32c991402005470e8eaf57294cf54e0e8c0e1a09a50c5bef88da37 The AWS outage is ongoing. Builds are currently broken due to an outage with EC2. We're waiting on AWS to resolve the issue and will keep you updated. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 13:28:00 -0000 https://status.cerebrium.ai/incident/746816#67599f05df32c991402005470e8eaf57294cf54e0e8c0e1a09a50c5bef88da37 The AWS outage is ongoing. Builds are currently broken due to an outage with EC2. We're waiting on AWS to resolve the issue and will keep you updated. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 13:28:00 -0000 https://status.cerebrium.ai/incident/746816#67599f05df32c991402005470e8eaf57294cf54e0e8c0e1a09a50c5bef88da37 The AWS outage is ongoing. Builds are currently broken due to an outage with EC2. We're waiting on AWS to resolve the issue and will keep you updated. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 13:28:00 -0000 https://status.cerebrium.ai/incident/746816#67599f05df32c991402005470e8eaf57294cf54e0e8c0e1a09a50c5bef88da37 The AWS outage is ongoing. Builds are currently broken due to an outage with EC2. We're waiting on AWS to resolve the issue and will keep you updated. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 13:28:00 -0000 https://status.cerebrium.ai/incident/746816#67599f05df32c991402005470e8eaf57294cf54e0e8c0e1a09a50c5bef88da37 The AWS outage is ongoing. Builds are currently broken due to an outage with EC2. We're waiting on AWS to resolve the issue and will keep you updated. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 11:10:00 -0000 https://status.cerebrium.ai/incident/746816#a49214b9a48be3601aad264a1fdf6dc91ff8867170cd7b4c97618fc61a65bc16 All services have now been restored fully. We will continue to monitor for any anomalies. Thank you for your patience and we apologise for the inconvenience. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 11:10:00 -0000 https://status.cerebrium.ai/incident/746816#a49214b9a48be3601aad264a1fdf6dc91ff8867170cd7b4c97618fc61a65bc16 All services have now been restored fully. We will continue to monitor for any anomalies. Thank you for your patience and we apologise for the inconvenience. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 11:10:00 -0000 https://status.cerebrium.ai/incident/746816#a49214b9a48be3601aad264a1fdf6dc91ff8867170cd7b4c97618fc61a65bc16 All services have now been restored fully. We will continue to monitor for any anomalies. Thank you for your patience and we apologise for the inconvenience. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 11:10:00 -0000 https://status.cerebrium.ai/incident/746816#a49214b9a48be3601aad264a1fdf6dc91ff8867170cd7b4c97618fc61a65bc16 All services have now been restored fully. We will continue to monitor for any anomalies. Thank you for your patience and we apologise for the inconvenience. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 11:10:00 -0000 https://status.cerebrium.ai/incident/746816#a49214b9a48be3601aad264a1fdf6dc91ff8867170cd7b4c97618fc61a65bc16 All services have now been restored fully. We will continue to monitor for any anomalies. Thank you for your patience and we apologise for the inconvenience. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 11:10:00 -0000 https://status.cerebrium.ai/incident/746816#a49214b9a48be3601aad264a1fdf6dc91ff8867170cd7b4c97618fc61a65bc16 All services have now been restored fully. We will continue to monitor for any anomalies. Thank you for your patience and we apologise for the inconvenience. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:43:00 -0000 https://status.cerebrium.ai/incident/746816#15bc346de987b0c270ff70ae21f1a5339045ee3e609949ccc47943dbc02a18d0 Most services have now recovered. You may still experience issues building apps on Cerebrium while AWS continues to resolve the remaining problems. We'll update you once everything is back to normal. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:43:00 -0000 https://status.cerebrium.ai/incident/746816#15bc346de987b0c270ff70ae21f1a5339045ee3e609949ccc47943dbc02a18d0 Most services have now recovered. You may still experience issues building apps on Cerebrium while AWS continues to resolve the remaining problems. We'll update you once everything is back to normal. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:43:00 -0000 https://status.cerebrium.ai/incident/746816#15bc346de987b0c270ff70ae21f1a5339045ee3e609949ccc47943dbc02a18d0 Most services have now recovered. You may still experience issues building apps on Cerebrium while AWS continues to resolve the remaining problems. We'll update you once everything is back to normal. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:43:00 -0000 https://status.cerebrium.ai/incident/746816#15bc346de987b0c270ff70ae21f1a5339045ee3e609949ccc47943dbc02a18d0 Most services have now recovered. You may still experience issues building apps on Cerebrium while AWS continues to resolve the remaining problems. We'll update you once everything is back to normal. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:43:00 -0000 https://status.cerebrium.ai/incident/746816#15bc346de987b0c270ff70ae21f1a5339045ee3e609949ccc47943dbc02a18d0 Most services have now recovered. You may still experience issues building apps on Cerebrium while AWS continues to resolve the remaining problems. We'll update you once everything is back to normal. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:43:00 -0000 https://status.cerebrium.ai/incident/746816#15bc346de987b0c270ff70ae21f1a5339045ee3e609949ccc47943dbc02a18d0 Most services have now recovered. You may still experience issues building apps on Cerebrium while AWS continues to resolve the remaining problems. We'll update you once everything is back to normal. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:31:00 -0000 https://status.cerebrium.ai/incident/746816#f2755af8f9d9beb9133349e36c2cb6dd9b14b1d56cc67d2fc1b92ca5cee1077f AWS has applied a fix and some services are starting to recover. You may still see some errors or slower response times as things fully stabilize. If something fails, please try again. We'll keep you posted as more services are restored. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:31:00 -0000 https://status.cerebrium.ai/incident/746816#f2755af8f9d9beb9133349e36c2cb6dd9b14b1d56cc67d2fc1b92ca5cee1077f AWS has applied a fix and some services are starting to recover. You may still see some errors or slower response times as things fully stabilize. If something fails, please try again. We'll keep you posted as more services are restored. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:31:00 -0000 https://status.cerebrium.ai/incident/746816#f2755af8f9d9beb9133349e36c2cb6dd9b14b1d56cc67d2fc1b92ca5cee1077f AWS has applied a fix and some services are starting to recover. You may still see some errors or slower response times as things fully stabilize. If something fails, please try again. We'll keep you posted as more services are restored. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:31:00 -0000 https://status.cerebrium.ai/incident/746816#f2755af8f9d9beb9133349e36c2cb6dd9b14b1d56cc67d2fc1b92ca5cee1077f AWS has applied a fix and some services are starting to recover. You may still see some errors or slower response times as things fully stabilize. If something fails, please try again. We'll keep you posted as more services are restored. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:31:00 -0000 https://status.cerebrium.ai/incident/746816#f2755af8f9d9beb9133349e36c2cb6dd9b14b1d56cc67d2fc1b92ca5cee1077f AWS has applied a fix and some services are starting to recover. You may still see some errors or slower response times as things fully stabilize. If something fails, please try again. We'll keep you posted as more services are restored. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:31:00 -0000 https://status.cerebrium.ai/incident/746816#f2755af8f9d9beb9133349e36c2cb6dd9b14b1d56cc67d2fc1b92ca5cee1077f AWS has applied a fix and some services are starting to recover. You may still see some errors or slower response times as things fully stabilize. If something fails, please try again. We'll keep you posted as more services are restored. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:01:00 -0000 https://status.cerebrium.ai/incident/746816#7f5682cdb78d1f389b7f350a2e1e75fd236a69267522cbc2fbf643b51989e0ad AWS has identified the root cause as a DNS resolution issue affecting DynamoDB and other services in US-EAST-1. They're working on multiple recovery paths to accelerate the fix. Cerebrium services remain impacted during this time. If you encounter errors, please continue to retry your requests. AWS will provide their next update by 2:45 AM. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:01:00 -0000 https://status.cerebrium.ai/incident/746816#7f5682cdb78d1f389b7f350a2e1e75fd236a69267522cbc2fbf643b51989e0ad AWS has identified the root cause as a DNS resolution issue affecting DynamoDB and other services in US-EAST-1. They're working on multiple recovery paths to accelerate the fix. Cerebrium services remain impacted during this time. If you encounter errors, please continue to retry your requests. AWS will provide their next update by 2:45 AM. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:01:00 -0000 https://status.cerebrium.ai/incident/746816#7f5682cdb78d1f389b7f350a2e1e75fd236a69267522cbc2fbf643b51989e0ad AWS has identified the root cause as a DNS resolution issue affecting DynamoDB and other services in US-EAST-1. They're working on multiple recovery paths to accelerate the fix. Cerebrium services remain impacted during this time. If you encounter errors, please continue to retry your requests. AWS will provide their next update by 2:45 AM. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:01:00 -0000 https://status.cerebrium.ai/incident/746816#7f5682cdb78d1f389b7f350a2e1e75fd236a69267522cbc2fbf643b51989e0ad AWS has identified the root cause as a DNS resolution issue affecting DynamoDB and other services in US-EAST-1. They're working on multiple recovery paths to accelerate the fix. Cerebrium services remain impacted during this time. If you encounter errors, please continue to retry your requests. AWS will provide their next update by 2:45 AM. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:01:00 -0000 https://status.cerebrium.ai/incident/746816#7f5682cdb78d1f389b7f350a2e1e75fd236a69267522cbc2fbf643b51989e0ad AWS has identified the root cause as a DNS resolution issue affecting DynamoDB and other services in US-EAST-1. They're working on multiple recovery paths to accelerate the fix. Cerebrium services remain impacted during this time. If you encounter errors, please continue to retry your requests. AWS will provide their next update by 2:45 AM. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 09:01:00 -0000 https://status.cerebrium.ai/incident/746816#7f5682cdb78d1f389b7f350a2e1e75fd236a69267522cbc2fbf643b51989e0ad AWS has identified the root cause as a DNS resolution issue affecting DynamoDB and other services in US-EAST-1. They're working on multiple recovery paths to accelerate the fix. Cerebrium services remain impacted during this time. If you encounter errors, please continue to retry your requests. AWS will provide their next update by 2:45 AM. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 08:29:00 -0000 https://status.cerebrium.ai/incident/746816#871cd301c7bebebef8f179e43876babbef14a6d7fbd37f53a467067d6240c74e The AWS team have narrowed critically affected services down, however, these services are core to the Cerebrium platform and your dashboards, builds, and endpoint calls are still affected. We are continuing to investigate and will provide more updates within the next 45 minutes. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 08:29:00 -0000 https://status.cerebrium.ai/incident/746816#871cd301c7bebebef8f179e43876babbef14a6d7fbd37f53a467067d6240c74e The AWS team have narrowed critically affected services down, however, these services are core to the Cerebrium platform and your dashboards, builds, and endpoint calls are still affected. We are continuing to investigate and will provide more updates within the next 45 minutes. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 08:29:00 -0000 https://status.cerebrium.ai/incident/746816#871cd301c7bebebef8f179e43876babbef14a6d7fbd37f53a467067d6240c74e The AWS team have narrowed critically affected services down, however, these services are core to the Cerebrium platform and your dashboards, builds, and endpoint calls are still affected. We are continuing to investigate and will provide more updates within the next 45 minutes. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 08:29:00 -0000 https://status.cerebrium.ai/incident/746816#871cd301c7bebebef8f179e43876babbef14a6d7fbd37f53a467067d6240c74e The AWS team have narrowed critically affected services down, however, these services are core to the Cerebrium platform and your dashboards, builds, and endpoint calls are still affected. We are continuing to investigate and will provide more updates within the next 45 minutes. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 08:29:00 -0000 https://status.cerebrium.ai/incident/746816#871cd301c7bebebef8f179e43876babbef14a6d7fbd37f53a467067d6240c74e The AWS team have narrowed critically affected services down, however, these services are core to the Cerebrium platform and your dashboards, builds, and endpoint calls are still affected. We are continuing to investigate and will provide more updates within the next 45 minutes. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 08:29:00 -0000 https://status.cerebrium.ai/incident/746816#871cd301c7bebebef8f179e43876babbef14a6d7fbd37f53a467067d6240c74e The AWS team have narrowed critically affected services down, however, these services are core to the Cerebrium platform and your dashboards, builds, and endpoint calls are still affected. We are continuing to investigate and will provide more updates within the next 45 minutes. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 07:38:00 -0000 https://status.cerebrium.ai/incident/746816#5a509bc68dcfde22169faca0750514fa7e5c34b578ec1b50a44df545757ed329 We are seeing elevated error rates from upstream AWS errors across the majority of our services in the us-east-1 region. We will share an update as soon as possible. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 07:38:00 -0000 https://status.cerebrium.ai/incident/746816#5a509bc68dcfde22169faca0750514fa7e5c34b578ec1b50a44df545757ed329 We are seeing elevated error rates from upstream AWS errors across the majority of our services in the us-east-1 region. We will share an update as soon as possible. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 07:38:00 -0000 https://status.cerebrium.ai/incident/746816#5a509bc68dcfde22169faca0750514fa7e5c34b578ec1b50a44df545757ed329 We are seeing elevated error rates from upstream AWS errors across the majority of our services in the us-east-1 region. We will share an update as soon as possible. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 07:38:00 -0000 https://status.cerebrium.ai/incident/746816#5a509bc68dcfde22169faca0750514fa7e5c34b578ec1b50a44df545757ed329 We are seeing elevated error rates from upstream AWS errors across the majority of our services in the us-east-1 region. We will share an update as soon as possible. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 07:38:00 -0000 https://status.cerebrium.ai/incident/746816#5a509bc68dcfde22169faca0750514fa7e5c34b578ec1b50a44df545757ed329 We are seeing elevated error rates from upstream AWS errors across the majority of our services in the us-east-1 region. We will share an update as soon as possible. Elevated upstream errors (us-east-1) https://status.cerebrium.ai/incident/746816 Mon, 20 Oct 2025 07:38:00 -0000 https://status.cerebrium.ai/incident/746816#5a509bc68dcfde22169faca0750514fa7e5c34b578ec1b50a44df545757ed329 We are seeing elevated error rates from upstream AWS errors across the majority of our services in the us-east-1 region. We will share an update as soon as possible. Degraded Inference API in US-EAST-1 https://status.cerebrium.ai/incident/740083 Wed, 08 Oct 2025 18:28:00 -0000 https://status.cerebrium.ai/incident/740083#89d8d4e8dd689746d3c782842aa817ffbafe52467e5adfe5607a9365aceac920 The Inference API is currently experiencing degraded performance in US-EAST-1. Our team is working on a fix ASAP Inference API https://status.cerebrium.ai/incident/737024 Fri, 03 Oct 2025 13:13:00 -0000 https://status.cerebrium.ai/incident/737024#e1c9c6e4c4fdf5e170832cdabbc8311af2f5a5ebda3688b6218f48be2e12c17e Inference API is currently experiencing a High 502 failure rate. Roughly 45% of all requests are affected. Our team is currently investigating the cause of the issue as a matter of high urgency. Container Count is down https://status.cerebrium.ai/incident/726877 Thu, 18 Sep 2025 23:05:00 -0000 https://status.cerebrium.ai/incident/726877#02b232bc3e6fa758f3a2ce6d5b1043c6c6f3e30c2573ee0fa7cd895a0e38bb5f A 3rd party provider is down affecting the container count on the dashboard.