Comments (5)
NB: just when I opened this issue, the analysis job started to make progress. So please ignore the first paragraph, while the second still holds.
from ocaml-ci.
Hello @hannesm !
A public status page doesn't currently exist, though for significant outages we do post on the infra blog.
from ocaml-ci.
Thanks for your comment @rikusilvola. Since yesterday afternoon, there's again first an outage, and now temporary failures.
I'm still wondering what is the Service Level that you intend to deliver? What are "significant outages" that are getting posted to the "infra blog"?
from ocaml-ci.
Indeed, several minor outages were experienced for OCaml-CI in the past few days. With increased load, the service became unresponsive but was recovered within a couple of hours each time. Initial investigations point to lwt starvation leading to the web interface getting stuck.
The services are provided with best-effort support, meaning that once an issue is noticed, it is treated during business hours according to its relative criticality. Most of the time, what is perceived as an outage is a reduced quality of service due to a temporary spike in activity. These outages are commonly transient, and the service is restored without human intervention.
Here are some examples of posts for significant outages
- http://infra.ocaml.org/2023/06/15/opam-repo-ci-ocaml-ci.html
- http://infra.ocaml.org/2023/05/05/opam-repo-ci.html
- http://infra.ocaml.org/2023/04/06/maintenance-operations.html
I welcome you to report any outage you experience on ocaml/infrastructure.
from ocaml-ci.
Thanks for your reply. What I understand (please correct me if I'm wrong) that "during office hours [unclear where], the service is maintained as we see fits [with some priority]". There's no SLA, human intervention is required for restarting / restoring the service when there is a spike in activity.
Most of the time, what is perceived as an outage is a reduced quality of service
You mean the 500 - internal server error - I get at the moment are "reduced quality of service"?
In any case, thanks for providing the free service. I'll close my issues and hope you'll eventually find time and energy to setup monitoring and more reliability.
from ocaml-ci.
Related Issues (20)
- Job waiting on ocluster resource marked as "Build failed" on github
- "experimental" "variants" HOT 5
- customized compiler distro? HOT 5
- lint-fmt fails on package using dune's "site" feature HOT 2
- `lint-doc` uses lower-bounds for installing dependencies HOT 1
- CI failure HOT 3
- UX HOT 5
- EXPERIMENTAL: (lint-opam) (failure: No *.opam files found!)
- Web UI reports "Oops! Internal Server Error" when job hasn't started yet
- Missing `.ocamlformat` version line causes analysis failure HOT 1
- failed: Failed to get head of ocaml/opam-repository:refs/heads/master HOT 1
- CI failure on 5.1 32-bit builders HOT 1
- OCaml 5.2 support HOT 2
- Add additional opam-repository / Dockerfile customization? HOT 3
- Naming of builders
- FreeBSD CI failures HOT 8
- debian-12 x86_32 workers fail due to old capnproto version HOT 5
- `lint-doc` job failing on installation of `conf-git` HOT 1
- `lint-fmt` job fails with "Error: Program ocamlformat not found in the tree or in PATH" HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from ocaml-ci.