Memory leaks
Your code might run now, but will it in the future?
Select your ecosystem
What is a memory leak?
Memory leaks are a common source of performance issues and instability in JavaScript applications. A memory leak occurs when a Node.js program fails to release memory that it no longer needs, causing the program to consume more and more memory over time. This can lead to poor performance, slow response times, and ultimately, cause the application and other applications to crash.
When an application does not need a memory block anymore, it should release it back to the OS. In the case of a memory leak, the garbage collector never collects the block and it stays on the heap.
There are several common causes of memory leaks in Node.js, such as global variables, multiple references, and incorrect use of closures and timers.
About this lesson
In this lesson, you will learn about vulnerabilities stemming from a memory leak and how to protect your applications against them. We will step into the shoes of a junior pentester, Jacob, who accidentally DoSed (Denial of Service) their client’s website when performing tests for a different type of vulnerability.
Jacob, a junior pentester for a cyber security firm in the south, is doing a pentest for SocialsCorp. SocialsCorp is a new social media platform where users have their own profile pages.
Jacob starts with trying to test the application for a common vulnerability type; IDOR (Insecure Direct Object Reference).
Let’s break down what happened in the story above. Jacob was sending requests to the user
endpoint with different user ID’s. After a handful of requests, the server got really slow and eventually crashed.
The backend code of the user
endpoint was not releasing memory after use, causing allocated memory to be never released and overall memory usage to climb. The backend code of the user
endpoint is the following:
The endpoint fetches the user from the database, then checks if the currently logged-in user is allowed to view that user, and if so, returns the user object.
There are two flaws here:
- The requested user is fetched from the database (if the
users
dictionary did not contain the user), regardless of the permissions that the authenticated user has. - The retrieved users are stored in a global variable that will keep growing larger as more users are fetched.
Each time a user is requested, the users
dictionary will grow. Jacob triggered an out of memory crash (a memory leak) by requesting a lot of different users in a short period of time. Because the user data can also contain profile pictures and other large data properties, the users
dictionary got to a point where it was consuming all memory.
What is the impact of a memory leak?
A memory leak in an application can have serious consequences for its performance and stability. If the process continues to consume more and more memory over time, it can eventually result in an out-of-memory error, causing the process to crash. Not only can the application that contains the leak crash but other applications might panic too, if they are not able to allocate new memory.
Additionally, memory leaks can lead to slower performance as the process spends more time managing and garbage collecting the increasing amount of memory it is using. This leads to decreased responsiveness and a poor user experience.
To migrate a memory leak in a Python application, you can follow these steps:
- Identify the source of the leak using tools such as the Python profiler module tracemalloc.
- Analyze the cause of the leak by reviewing the code, analyzing performance data, and reproducing the issue.
- Implement a fix to address the leak, which may involve refactoring code, optimizing data structures, or using different approaches to allocate and deallocate memory.
- Test and validate the fix by using tools from step 1, to verify that the leak has been fixed.
- Monitor and optimize your application's memory usage by performing regular performance testing, implementing best practices for memory management, and using tools like a heap snapshot analyzers to identify and fix any issues that may arise.
To migrate the memory leak in the users
endpoint code:
- Implement an eviction policy: To prevent the users object from growing indefinitely, you can implement a policy that removes the least recently used users from the object to make room for new ones.
- Use a cache library: Instead of manually implementing a cache, you could use a library that has built-in eviction policies and expiration times. This can help to automatically manage the
users
object and prevent it from growing too large. - Use a memory-efficient data structure: Instead of using an object to store the users, you could consider using a data structure that is more memory-efficient, such as a linked list or a queue. This will allow you to store a large number of users without consuming too much memory.
- Additionally, the objects stored in memory should only contain minimal information. Profile pictures or other large data should be referenced and not stored directly.
- Monitor memory usage: Regularly monitoring your application's memory usage can help you identify potential memory leaks and take steps to fix them before they become a problem. You can use the Python’s module tracemalloc or other debugger and heap snapshot analyzers to monitor memory usage and identify any issues.
- Had
socialcorp.com
monitored their memory usage, they would have spotted the memory leak themselves instead of stumbling upon it when it was too late. - Only store data that you really need. Perform the right checks, such as permission checks, first, and only then fetch the user object. Storing the user object in the
users
dictionary while you don’t even know you if you need it, is a waste of resources and memory space.
In Python, you can easily create an LRU (least recently used) cache using the functools.lru_cache
decorator. This is available in Python 3.2 and later.
Implementing the functools.lru_cache decorator into the users
endpoint code, and performing the permissions check first is very easy in Python. Just move the permissions check to the top of the get_user
function, import lru_cache from functools
, and finally, add @lru_cache(maxsize=100
) above the fetchUserFromDatabase
function.
That's it! As easy as that. The function calls will now automatically be cached, and the database is queried only once per unique user.
After these modifications, the code will looks something like this:
To migrate a memory leak in a web application, you can follow these steps:
- Identify the source of the leak using tools such as the debuggers, profilers and memory usage diagnostic tools.
- Analyze the cause of the leak by reviewing the code, analyzing performance data, and reproducing the issue.
- Implement a fix to address the leak, which may involve refactoring code, optimizing data structures, or using different approaches to allocate and deallocate memory.
- Test and validate the fix by using tools from step 1, to verify that the leak has been fixed.
- Monitor and optimize your application's memory usage by performing regular performance testing, implementing best practices for memory management, and using tools like the debugger and memory usage diagnostic tool to identify and fix any issues that may arise.
To mitigate the memory leak in the /user/
endpoint code, you can try these steps:
- Implement an eviction policy: To prevent the users object from growing indefinitely, you can implement a policy that removes the least recently used users from the object to make room for new ones.
- Use a caching library: Instead of manually implementing a cache, you could use a library that has built-in eviction policies and expiration times. This can help to automatically manage memory and prevent it from growing too large.
- Monitor memory usage: Regularly monitoring your application's memory usage can help you identify potential memory leaks and take steps to fix them before they become a problem. Most hosting providers have dashboards of the servers' memory usage. Had target.com monitored their memory usage, they would have spotted the memory leak themselves instead of stumbling upon it when it was too late.
- Only store data that you really need. Perform the right checks, such as permission checks, first, and only then fetch the user object. Storing the user object in the
users
variable while you don’t even know you need it is a waste of memory space.
C# .NET Core 7 and onwards has its own caching module, and it's very easy to use. Caching the responses from the /user/ endpoint
, and checking permissions before the DB lookup looks like this:
Keep learning
- OWASP has a page dedicated to memory leaks that can be found here: https://owasp.org/www-community/vulnerabilities/Memory_leak
- You can also read more about this CWE at https://cwe.mitre.org/data/definitions/401.html
- Check out our Snyk blog post about memory leaks and Python