Difference between revisions of "Netdata"

From Freephile Wiki
Jump to navigation Jump to search
(adds section on updates)
Line 51: Line 51:
  
 
You'll probably receive alarms for 'tcp listen drops'. This is likely bot-related, and there is a good discussion on how to identify the source of the problem and how to mitigate or resolve it https://github.com/firehol/netdata/issues/3234
 
You'll probably receive alarms for 'tcp listen drops'. This is likely bot-related, and there is a good discussion on how to identify the source of the problem and how to mitigate or resolve it https://github.com/firehol/netdata/issues/3234
 +
 +
Following the advice from NASA at https://wiki.earthdata.nasa.gov/display/HDD/SOMAXCONN, I increased my somaxconn kernel parameter to 1024 from 128
 +
<source lang="bash">
 +
cat /proc/sys/net/core/somaxconn
 +
128
 +
sysctl -w net.core.somaxconn=1024
 +
</source>
 +
 +
[[File:Tcp state diagram fixed.svg|600px|TCP State diagram]]
 +
 +
  
 
== Updates ==
 
== Updates ==
Netdata will update itself
+
Netdata will update itself, and puts a script into cron:
<source lang="bash">
+
<code>
 
ln -s /root/netdata/netdata-updater.sh /etc/cron.daily/netdata-updater
 
ln -s /root/netdata/netdata-updater.sh /etc/cron.daily/netdata-updater
</source>
+
</code>
  
 
[[Category:QualityBox]]
 
[[Category:QualityBox]]
 
[[Category:Monitoring]]
 
[[Category:Monitoring]]

Revision as of 14:14, 10 July 2018

Netdata will be part of the QualityBox dashboard.

See http://wiki.freephile.org:20000/

System Locations[edit | edit source]

   - the daemon     at /usr/sbin/netdata
   - config files   in /etc/netdata
   - web files      in /usr/share/netdata
   - plugins        in /usr/libexec/netdata
   - cache files    in /var/cache/netdata
   - db files       in /var/lib/netdata
   - log files      in /var/log/netdata
   - pid file       at /var/run/netdata.pid
   - logrotate file at /etc/logrotate.d/netdata

Host Modifications[edit | edit source]

Check KSM (kernel memory deduper)

Memory de-duplication instructions

You have kernel memory de-duper (called Kernel Same-page Merging, or KSM) available, but it is not currently enabled.

To enable it run:

   echo 1 >/sys/kernel/mm/ksm/run
   echo 1000 >/sys/kernel/mm/ksm/sleep_millisecs

If you enable it, you will save 40-60% of netdata memory.


Ports[edit | edit source]

netdata by default listens on all IPs on port 19999. We add a rule to firewalld to allow 20000 and then pass that port through to the backend in haproxy config.

 http://this.machine.ip:20000/ => http://127.0.0.1:19999

Start/Stop[edit | edit source]

To stop netdata run: systemctl stop netdata

To start netdata run: systemctl start netdata

Installation extras[edit | edit source]

The configuration will send messages to 'root' so be sure to either edit the conf sudo vim /etc/netdata/health_alarm_notify.conf, or set vim /etc/aliases && newaliases

You'll probably receive alarms for 'tcp listen drops'. This is likely bot-related, and there is a good discussion on how to identify the source of the problem and how to mitigate or resolve it https://github.com/firehol/netdata/issues/3234

Following the advice from NASA at https://wiki.earthdata.nasa.gov/display/HDD/SOMAXCONN, I increased my somaxconn kernel parameter to 1024 from 128

 cat /proc/sys/net/core/somaxconn
 128
 sysctl -w net.core.somaxconn=1024

TCP State diagram


Updates[edit | edit source]

Netdata will update itself, and puts a script into cron: ln -s /root/netdata/netdata-updater.sh /etc/cron.daily/netdata-updater