Netdata is distributed, real-time, performance and health monitoring for systems and applications. It is a highly optimized monitoring agent you install on all your systems and containers.lfx快充网络
Netdata provides unparalleled insights, in real-time, of everything happening on the systems it runs (including web servers, databases, applications), using highly interactive web dashboards. It can run autonomously, without any third party components, or it can be integrated to existing monitoring tool chains (Prometheus, Graphite, OpenTSDB, Kafka, Grafana, etc).lfx快充网络
Netdata is fast and efficient, designed to permanently run on all systems (physical & virtual servers, containers, IoTdevices), without disrupting their core function.lfx快充网络
Netdata is free, open-source software and it currently runs on Linux, FreeBSD, and MacOS.lfx快充网络
The following animated image, shows the top part of a typical netdata dashboard.lfx快充网络
lfx快充网络
A typical netdata dashboard, in 1:1 timing. Charts can be panned by dragging them, zoomed in/out with SHIFT + mouse wheel, an area can be selected for zoom-in with SHIFT + mouse selection. Netdata is highly interactive and real-time, optimized to get the work done!lfx快充网络
Netdata is used by hundreds of thousands of users all over the world. Check our GitHub watchers list. You will find people working for Amazon, Atos, Baidu, Cisco Systems, Citrix, Deutsche Telekom, DigitalOcean, Elastic, EPAM Systems, Ericsson, Google, Groupon, Hortonworks, HP, Huawei, IBM, Microsoft, NewRelic, Nvidia, Red Hat, SAP, Selectel, TicketMaster,Vimeo, and many more!lfx快充网络
Docker pulls
We provide docker images for the most common architectures. These are statistics reported by docker hub:lfx快充网络
lfx快充网络
Registry
When you install multiple netdata, they are integrated into one distributed application, via a netdata registry. This is a web browser feature and it allows us to count the number of unique users and unique netdata servers installed. The following information comes from the global public netdata registry we run:lfx快充网络
lfx快充网络
in the last 24 hours: lfx快充网络
Quick Start
You can quickly install netdata on a Linux box (physical, virtual, container, IoT) with the following command:lfx快充网络
# make sure you run `bash` for your shell
bash
# install netdata, directly from github sources
bash <(curl -Ss https://my-netdata.io/kickstart.sh)
lfx快充网络
The above command will:lfx快充网络
install any required packages on your system (it will ask you to confirm before doing so),
compile it, install it and start it
More installation methods and additional options can be found at the installation page.lfx快充网络
To try netdata in a docker container, run this:lfx快充网络
Release 1.14 contains 14 bug fixes and 24 improvements.lfx快充网络
The release introduces major additions to Kubernetes monitoring, with tens of new charts for Kubelet, kube-proxy and coredns metrics, as well as significant improvements to the netdata helm chart.lfx快充网络
Release 1.13.0 contains 14 bug fixes and 8 improvements.lfx快充网络
netdata has taken the first step into the world of Kubernetes, with a beta version of a Helm chart for deployment to a k8s cluster and proper naming of the cgroup containers. We have big plans for Kubernetes, so stay tuned!lfx快充网络
Patch release 1.12.2 contains 7 bug fixes and 4 improvements.lfx快充网络
The main motivation behind a new patch release is the introduction of a stable release channel. A "stable" installation and update channel was always on our roadmap, but it became a necessity when we realized that our users in China could not use the nightly releases published on Google Cloud. The "stable" channel is based on our official GitHub releases and uses assets hosted on GitHub.lfx快充网络
We are also introducing a new Oracle DB collector module, implemented in Python.lfx快充网络
Improved data collection plugins: netdata now supports monitoring network interface aliases, smartd_log, cpufreq, sensors.
Health monitoring improvements: network interface congestion alarm restored, alerta.io, conntrack_max.
my-netdatamenu has been refactored.
Packaging: openrc service definition got a few improvements.
Sep 18, 2018 - netdata has its own organizationlfx快充网络
Netdata used to be a firehol.org project, accessible as firehol/netdata.lfx快充网络
Netdata now has its own github organization netdata, so all github URLs are now netdata/netdata. The old github URLs, repo clones, forks, etc redirect automatically to the new repo.lfx快充网络
How it works
Netdata is a highly efficient, highly modular, metrics management engine. Its lockless design makes it ideal for concurrent operations on the metrics.lfx快充网络
lfx快充网络
This is how it works:lfx快充网络
Function
Description
Documentation
Collect
Multiple independent data collection workers are collecting metrics from their sources using the optimal protocol for each application and push the metrics to the database. Each data collection worker has lockless write access to the metrics it collects.
A lockless independent watchdog is evaluating health checks on the collected metrics, triggers alarms, maintains a health transaction log and dispatches alarm notifications.
The result is a highly efficient, low latency system, supporting multiple readers and one writer on each metric.lfx快充网络
Infographic
This is a high level overview of netdata feature set and architecture. Click it to to interact with it (it has direct links to documentation).lfx快充网络
lfx快充网络
Features
lfx快充网络
This is what you should expect from Netdata:lfx快充网络
General
1s granularity - the highest possible resolution for all metrics.
Unlimited metrics - collects all the available metrics, the more the better.
1% CPU utilization of a single core - it is super fast, unbelievably optimized.
A few MB of RAM - by default it uses 25MB RAM. You size it.
Zero disk I/O - while it runs, it does not load or save anything (except error and access logs).
Zero configuration - auto-detects everything, it can collect up to 10000 metrics per server out of the box.
Zero maintenance - You just run it, it does the rest.
Zero dependencies - it is even its own web server, for its static web files and its web API (though its plugins may require additional libraries, depending on the applications monitored).
Scales to infinity - you can install it on all your servers, containers, VMs and IoTs. Metrics are not centralized by default, so there is no limit.
Several operating modes - Autonomous host monitoring (the default), headless data collector, forwarding proxy, store and forward proxy, central multi-host monitoring, in all possible configurations. Each node may have different metrics retention policy and run with or without health monitoring.
Health Monitoring & Alarms
Sophisticated alerting - comes with hundreds of alarms, out of the box! Supports dynamic thresholds, hysteresis, alarm templates, multiple role-based notification methods.
Integrations
time-series dbs - can archive its metrics to graphite, opentsdb, prometheus, json document DBs, in the same or lower resolution (lower: to prevent it from congesting these servers due to the amount of data collected).
Visualization
Stunning interactive dashboards - mouse, touchpad and touch-screen friendly in 2 themes: slate (dark) and white.
Amazingly fast visualization - responds to all queries in less than 1 ms per metric, even on low-end hardware.
Visual anomaly detection - the dashboards are optimized for detecting anomalies visually.
Customizable - custom dashboards can be built using simple HTML (no javascript necessary).
Positive and negative values
To improve clarity on charts, netdata dashboards present positive values for metrics representing read, input, inbound, received and negative values for metrics representing write, output, outbound, sent.lfx快充网络
lfx快充网络
Netdata charts showing the bandwidth and packets of a network interface. received is positive and sent is negative.lfx快充网络
Autoscaled y-axis
Netdata charts automatically zoom vertically, to visualize the variation of each metric within the visible time-frame.lfx快充网络
lfx快充网络
A zero based stacked chart, automatically switches to an auto-scaled area chart when a single dimension is selected.lfx快充网络
Charts are synchronized
Charts on netdata dashboards are synchronized to each other. There is no master chart. Any chart can be panned or zoomed at any time, and all other charts will follow.lfx快充网络
lfx快充网络
Charts are panned by dragging them with the mouse. Charts can be zoomed in/out withSHIFT + mouse wheel while the mouse pointer is over a chart.lfx快充网络
The visible time-frame (pan and zoom) is propagated from netdata server to netdata server, when navigating via the my-netdata menu.lfx快充网络
Highlighted time-frame
To improve visual anomaly detection across charts, the user can highlight a time-frame (by pressing ALT + mouse selection) on all charts.lfx快充网络
lfx快充网络
A highlighted time-frame can be given by pressing ALT + mouse selection on any chart. Netdata will highlight the same range on all charts.lfx快充网络
Highlighted ranges are propagated from netdata server to netdata server, when navigating via the my-netdata menu.lfx快充网络
What does it monitor
Netdata data collection is extensible - you can monitor anything you can get a metric for. Its Plugin API supports all programing languages (anything can be a netdata plugin, BASH, python, perl, node.js, java, Go, ruby, etc).lfx快充网络
For better performance, most system related plugins (cpu, memory, disks, filesystems, networking, etc) have been written in C.
For faster development and easier contributions, most application related plugins (databases, web servers, etc) have been written in python.
APM (Application Performance Monitoring)
statsd - netdata is a fully featured statsd server.
Go expvar - collects metrics exposed by applications written in the Go programming language using the expvar package.
Spring Boot - monitors running Java Spring Boot applications that expose their metrics with the use of the Spring Boot Actuator included in Spring Boot library.
uWSGI - collects performance metrics from uWSGI applications.
Network Stack - everything about the networking stack (both IPv4 and IPv6 for all protocols: TCP, UDP, SCTP, UDPLite, ICMP, Multicast, Broadcast, etc), and all network interfaces (per interface: bandwidth, packets, errors, drops).
Netfilter - everything about the netfilter connection tracker.
SynProxy - collects performance data about the linux SYNPROXY (DDoS).
Applications - by grouping the process tree and reporting CPU, memory, disk reads, disk writes, swap, threads, pipes, sockets - per process group.
systemd - monitors systemd services using CGROUPS.
Users
Users and User Groups resource usage - by summarizing the process tree per user and group, reporting: CPU, memory, disk reads, disk writes, swap, threads, pipes, sockets
logind - collects sessions, users and seats connected.
Containers and VMs
Containers - collects resource usage for all kinds of containers, using CGROUPS (systemd-nspawn, lxc, lxd, docker, kubernetes, etc).
libvirt VMs - collects resource usage for all kinds of VMs, using CGROUPS.
Squid - multiple servers, each showing: clients bandwidth and requests, servers bandwidth and requests.
Traefik - connects to multiple traefik instances (local or remote) to collect API metrics (response status code, response time, average response time and server uptime).
Varnish - threads, sessions, hits, objects, backends, etc.
IPVS - collects metrics from the Linux IPVS load balancer.
Database Servers
CouchDB - reads/writes, request methods, status codes, tasks, replication, per-db, etc.
MemCached - multiple servers, each showing: bandwidth, connections, items, etc.
MongoDB - operations, clients, transactions, cursors, connections, asserts, locks, etc.
MySQL and mariadb - multiple servers, each showing: bandwidth, queries/s, handlers, locks, issues, tmp operations, connections, binlog metrics, threads, innodb metrics, and more.
PostgreSQL - multiple servers, each showing: per database statistics (connections, tuples read - written - returned, transactions, locks), backend processes, indexes, tables, write ahead, background writer and more.
ISC Bind (named) - multiple servers, each showing: clients, requests, queries, updates, failures and several per view metrics. All versions of bind after 9.9.10 are supported.
NSD - queries, zones, protocols, query types, transfers, etc.
FreeRadius - uses the radclient command to provide freeradius statistics (authentication, accounting, proxy-authentication, proxy-accounting).
Telephony Servers
opensips - connects to an opensips server (localhost only) to collect real-time performance metrics.
Household Appliances
SMA webbox - connects to multiple remote SMA webboxes to collect real-time performance metrics of the photovoltaic (solar) power generation.
Fronius - connects to multiple remote Fronius Symo servers to collect real-time performance metrics of the photovoltaic (solar) power generation.
StiebelEltron - collects the temperatures and other metrics from your Stiebel Eltron heating system using their Internet Service Gateway (ISG web).
Game Servers
SpigotMC - monitors Spigot Minecraft server ticks per second and number of online players using the Minecraft remote console.
Distributed Computing
BOINC - monitors task states for local and remote BOINC client software using the remote GUI RPC interface. Also provides alarms for a handful of error conditions.
Media Streaming Servers
IceCast - collects the number of listeners for active sources.
Monitoring Systems
Monit - collects metrics about monit targets (filesystems, applications, networks).
Provisioning Systems
Puppet - connects to multiple Puppet Server and Puppet DB instances (local or remote) to collect real-time status metrics.
You can easily extend Netdata, by writing plugins that collect data from any source, using any computer language.lfx快充网络
Documentation
The netdata documentation is at https://docs.netdata.cloud. But you can also find it inside the repo, so by just navigating the repo on github you can find all the documentation.lfx快充网络