Difference between revisions of "Status"

From Freephile Wiki
Jump to navigation Jump to search
(Created page with "Status is important. It might be a status report, or sometimes it's "network status" (are we up?) You can setup custom monitoring services and playbooks for failover and reco...")
 
 
(6 intermediate revisions by one other user not shown)
Line 1: Line 1:
 
Status is important. It might be a status report, or sometimes it's "network status" (are we up?)  You can setup custom monitoring services and playbooks for failover and recovery.  However, sometimes a simple 3rd party service might suffice.  A newcomer in the field is https://statuspage.io
 
Status is important. It might be a status report, or sometimes it's "network status" (are we up?)  You can setup custom monitoring services and playbooks for failover and recovery.  However, sometimes a simple 3rd party service might suffice.  A newcomer in the field is https://statuspage.io
 +
 +
Another project is Graphite <ref>https://graphite.readthedocs.io/en/latest/tools.html</ref>, which can be used with the built-in statsd collection in MediaWiki <ref>https://github.com/liuggio/statsd-php-client/</ref>
 +
 +
In the Cloud-Native world, [https://prometheus.io/ Prometheus] is an active project. It's a time-series database and metrics monitoring solution, [https://prometheus.io/docs/instrumenting/exporters/ with many exporters] including haproxy, memcached, mysqld, etc..
 +
{{Video|url=https://www.youtube.com/watch?v=uV_sh7_lVw8}} [https://www.slideshare.net/brianbrazil/prometheus-from-berlin-to-bonanza-cloudnativeconkubecon-europe-2017 slides for the Video]
 +
 +
Prometheus is able to alert on any metric, so long as the data is available. This data comes from your [[wp:Instrumentation (computer programming)|code Instrumentation]]. See their docs for information on Instrumenting your code. https://prometheus.io/docs/practices/instrumentation/
 +
 +
And for a great dashboard, there is [[Netdata]]
 +
 +
{{References}}
 +
 +
[[Category:System Administration]]
 +
[[Category:Wiki]]
 +
[[Category:System Architecture]]
 +
[[Category:Performance]]
 +
[[Category:SRE]]

Latest revision as of 15:42, 9 August 2023

Status is important. It might be a status report, or sometimes it's "network status" (are we up?) You can setup custom monitoring services and playbooks for failover and recovery. However, sometimes a simple 3rd party service might suffice. A newcomer in the field is https://statuspage.io

Another project is Graphite [1], which can be used with the built-in statsd collection in MediaWiki [2]

In the Cloud-Native world, Prometheus is an active project. It's a time-series database and metrics monitoring solution, with many exporters including haproxy, memcached, mysqld, etc..

slides for the Video

Prometheus is able to alert on any metric, so long as the data is available. This data comes from your code Instrumentation. See their docs for information on Instrumenting your code. https://prometheus.io/docs/practices/instrumentation/

And for a great dashboard, there is Netdata

References[edit source]