Hi, I have a bunch of Raspberry Pies hosting all kinds of stuff and I want to have a monitoring solution for all of that. What would be your recommendations?
My goal is to be able to have an overview of CPU load, network load, CPU temp and to see what’s going on inside docker containers as I have everything dockerized. I’d like the solution to be open source. I want the solution to be web browser accessible and have nice load graphs with history. I don’t want to spend too much time setting it up.
All my Pies are running RaspberryOS, which is Debian based.
A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.
Rules:
Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.
No spam posting.
Posts have to be centered around self-hosting. There are other communities for discussing hardware or home computing. If it’s not obvious why your post topic revolves around selfhosting, please include details to make it clear.
Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.
Resources:
Any issues on the community? Report it using the report flag.
Questions? DM the mods!
Grafana, influxdb, telegraf agents. Easy to setup. Barely any configuration required. Everything you asked for in the default telegraf agent config. There are dashboards with plenty of examples on grafanas website.
What’s the difference between Prometheus and Telegraf? Why do you prefer Telegraf?
My work environments use Prometheus and node-exporter and grafana. At home I use telegraf, influxdb and grafana (and Prometheus for other app specific metrics) but the biggest reason I went with telegraf and influxdb at home is because Prometheus scrapes data from the configured clients (pull), while telegraf sends the data on the configured interval to influxdb (push) and starting my homelab adventure I had 2 VMS in the cloud and 2 pis at home and having telegraf sending the data in to my pis rather than going out and scraping made it a lot easier for that remote setup. I had influxdb setup behind a reverse proxy and auth so telegraf was sending data over TLS and needed to authenticate to just the single endpoint. That is the major difference to me, but there are also subsets of other exporters and plugins and stuff to tailor data for each one depending on what you want.
Ok, great to know, thanks!
Influxdb is a “time series” database for storing metrics. Temperatures, ram usage, cpu usage with time stamps. Telegraf is the client side agent that sends those metrics to the database in json format. Prometheus does pretty much the same thing but is a bit too bloated for my liking, so I went back to Influx.
prometheus is bloated?
Glances should be right up your alley.
For monitoring network devices (SNMP) I recommend LibreNMS. You can configure it to send data to other databases like Prometheus for centralization of all metrics.
https://www.librenms.org/
Second for Netdata for the temps and load info, portainer for docker monitoring. Netdata gives you more real time info than even glances. Portainer is an easy way to look at logs and such, I don’t use it to manage, prefer command line for that. Netdata we’ll give you some docker info, but not logs.
I use Ansible for management, I just want to see nice graphs and maybe get alerts when things go south. Thanks for recommendation.
Monit
I’m kind of loving Zabbix, but not sure if it’s the right solution for your needs. I’d say it would definitely work, but does take a bit of setup initially. This article is interesting, and seems to have a lot of what you want. Not sure if you want to do all of this. https://opensource.com/article/23/3/build-raspberry-pi-dashboard-appsmith
I use netdata, it’s quick and easy, but I don’t think it monitors docker containers specifically.
I’ll take a look, thanks!
I’m suggesting Homepage for someone for the second time.
https://github.com/gethomepage/homepage
I’ll take a look, thanks.
I don’t use Docker, so this may not help you, but I find bpytop and ssh works just fine. :)