Since more than a week, each day my mini server has a oom and all containers are getting killed by oom. How do I find the reason?
Bonus: how can they automatically restart again. I have to do it manually.
Edit: I did not change my system that ran for a while without problems.
OOMs happen because your system is out of memory.
You asked how to know which process is responsible. There is no correct answer to which process is “wrong” in using more memory — all one can say is that too many pro. The kernel tries to “blame” a process and will kill it, as you’ve seen, to let your system continue to function, but ultimately, you may know better than it which is acting in a way you don’t want. It tends to go after
It should log something to the kernel log when it OOM kills something.
It may be that you simply don’t have enough memory to do what you want to do. You could take a glance in
top
(sort by memory usage with shift-M). You might be able to get by by adding more paging (swap) space. You can do this with a paging file if it’s problematic to create a paging partition.