无法在 AWS EC2 实例上分配可用内存（甚至一半！）

Question 1

看起来有人vm.overcommit_memory在新图像中将值设置为 2。

https://www.kernel.org/doc/Documentation/vm/overcommit-accounting：

2   -   Don't overcommit. The total address space commit
        for the system is not permitted to exceed swap + a
        configurable amount (default is 50%) of physical RAM.
        Depending on the amount you use, in most situations
        this means a process will not be killed while accessing
        pages but will receive errors on memory allocation as
        appropriate.

要解决该问题 - 启用 vm.overcommit_memory（将其设置为 0），或调整 vm.overcommit_ratio，或进行 30Gb 交换。

我真的不知道如何解决这些奇怪的问题，但我可能会做以下事情：

阅读所有与内存管理相关的内核文档。
比较vm.*两台服务器上的 sysctl 参数。
检查 dmesg 消息中是否存在硬件/系统错误。
使用调试信息构建内核，附加调试器，在 mmap 系统调用附近的某处设置断点并查看发生了什么。

Answer

看起来有人vm.overcommit_memory在新图像中将值设置为 2。

https://www.kernel.org/doc/Documentation/vm/overcommit-accounting：

2   -   Don't overcommit. The total address space commit
        for the system is not permitted to exceed swap + a
        configurable amount (default is 50%) of physical RAM.
        Depending on the amount you use, in most situations
        this means a process will not be killed while accessing
        pages but will receive errors on memory allocation as
        appropriate.

要解决该问题 - 启用 vm.overcommit_memory（将其设置为 0），或调整 vm.overcommit_ratio，或进行 30Gb 交换。

我真的不知道如何解决这些奇怪的问题，但我可能会做以下事情：

阅读所有与内存管理相关的内核文档。
比较vm.*两台服务器上的 sysctl 参数。
检查 dmesg 消息中是否存在硬件/系统错误。
使用调试信息构建内核，附加调试器，在 mmap 系统调用附近的某处设置断点并查看发生了什么。

Question 2

另一个可能的原因是 Linux 内核的值vm.max_map_count限制了您的应用。它设置了进程可以拥有的 mmap 数量的最大值，这可能会导致应用出现堆分配错误，例如：

fatal error: out of memory allocating heap arena metadata

使用以下方法读取当前值：

sudo sysctl vm.max_map_count

使用以下方法更新值：

# Double the value
sudo sysctl -w vm.max_map_count=131072

# Apply now during runtime
sudo sysctl -p

Answer

另一个可能的原因是 Linux 内核的值vm.max_map_count限制了您的应用。它设置了进程可以拥有的 mmap 数量的最大值，这可能会导致应用出现堆分配错误，例如：

fatal error: out of memory allocating heap arena metadata

使用以下方法读取当前值：

sudo sysctl vm.max_map_count

使用以下方法更新值：

# Double the value
sudo sysctl -w vm.max_map_count=131072

# Apply now during runtime
sudo sysctl -p

无法在 AWS EC2 实例上分配可用内存（甚至一半！）

答案1

答案2

相关内容