As in, when I watched YouTube tutorials, I often see YouTubers have a small widget on their desktop giving them an overview of their ram usage, security level, etc. What apps do you all use to track this?
I currently use thr classic “Hu seems slow, checks basic things like disk usage and process CPU/RAM usage I’ll do a reboot to fix it for now”.
This is me. Can’t hurt to just do a reboot
Windows Server? )
Checkmk (Raw - free version.) Some setup aspects are a bit annoying (wants to monitor every last ZFS dataset and takes too long to ‘ignore’ them one by one.) It does alert me to things that could cause issues, like the boot partition almost full. I run it in a Docker container on my (primarily) file server.
Netdata, I’ve meant to look into Grafana but it always seemed way too overcomplicated and heavy for my purposes. Maybe one day, though…
I thought the same thing but it’s not bad actually, there are some pre build dashboards you can import for common metrics from Linux, windows, firewalls etc …
netdata is much better though (IMHO)
I know that it needs a fix when my dad complaining that he can’t watch TV and the rolling door doesn’t open in the morning.
I just check the proxmox dashboard every now and then. Honestly if everything is working I’m not too worried about exact ram levels at any given moment
Uptime Kuma and Grafana. Uptime Kuna to monitor if a service is up and running and Grafana to monitor the host like CPU, RAM, SSD usage etc.
Thank you for this. I appreciate the support.
Same here, also have some autoscaling mechanisms set up in docker swarm to scale certain services in case the load is high
Netdata, monitoring a few thousand servers (virtual) that way.
Just to make sure: You are aware that a search option here exists, yes? And you keep refusing to use it for whatever reason?
I personally use Influxdb , telegraf and grafana
Girlfriend first Alert Manager second. Girlfriend is usually faster.
Uptime Kuma for my services Netdata + Prometheus + Grafana for server health (alerts and visualization)
I use btop, I use arch btw
Nobody mentioned htop 🤔
htop is a selfhosted service?
Bashtop is pretty. But not scalable.
Btop
TICK stack is the only answer
I recommend Checkmk. https://checkmk.com/
I second CMK.
A TICK stack is unwieldy, Grafana takes a lot of setup, and all of this assumes you both know what to monitor and get stats on it.
CMK by contrast is plug and play. Install the server on a VM or host, install thr agent on your other systems, and you’re good to go.
I’m running a tick stack with a couple of thousands of servers - way less CPU usage than checkmk/nagios or anything else from the previous millennium …
How do you solve the problem of runaway memory usage? Even monitoring a few dozen hosts, memory usage would grow to many GB and continue to grow indefinitely until it OOM’d, and from my reading Influx has no way to prevent this.
Have you had runaway memory problems with influx, or your apps?
Specifically with Influx.