top of page

Simplify ALB Log Analysis with Serverless Architecture: A Developer's Guide

Aug 6, 2024

4 min read

1

33

0

Introduction


As developers and AWS administrators, we often find ourselves drowning in a sea of Application Load Balancer (ALB) logs, struggling to extract meaningful insights. What if there was a way to automate this process, providing you with regular, easy-to-understand reports? Today, I'm excited to introduce a project that does just that: a serverless application designed to help you gain visibility into your ALB logs through automated queries and visualisations.



The Problem: Navigating the ALB Log


ALB logs contain a wealth of information about your application's traffic, performance, and potential security issues. However, manually sifting through these logs is time-consuming and prone to oversight. Many teams struggle to:

  1. Regularly analyze ALB logs for patterns and anomalies

  2. Generate consistent reports for stakeholders

  3. Store historical data for trend analysis

  4. Quickly access and visualise log insights


The Solution: Serverless ALB Log Analysis


My project leverages AWS serverless technologies to create an automated, scalable solution for ALB log analysis. Here's what makes it special:

  • Automated Queries: Daily execution of predefined Athena queries against your ALB logs

  • Persistent Storage: Results stored in DynamoDB for quick retrieval and historical analysis

  • API Access: Easy retrieval of query results through API Gateway

  • Visualization: Simple web interface to view and interact with the data

  • Serverless Architecture: Utilising AWS Lambda, S3, DynamoDB, and API Gateway for a scalable, cost-effective solution


Solution Architecture


Let's break down the key components of our serverless application:

  1. S3 Bucket (AthenaQueryResultS3Bucket): Stores the results of our Athena queries

  2. API Gateway (albResultsAPI): Provides an HTTP endpoint to retrieve query results

  3. Lambda Function (albReports): Executes Athena queries and stores results in DynamoDB

  4. DynamoDB Table (QueryResultsTable): Stores query results for fast retrieval

  5. Lambda Function (RetrieveQueryResults): Fetches data from DynamoDB when called via API Gateway



The application works as follows:

  1. The albReports Lambda function runs daily, executing predefined Athena queries against your ALB logs

  2. Query results are stored in both S3 and DynamoDB

  3. The RetrieveQueryResults Lambda function can be called via API Gateway to fetch the latest results

  4. A simple web interface visualises the data retrieved from the API


Prerequisites


Before deploying this solution, ensure you have the following:

  1. An existing Athena Database

  2. An Athena Table with partitioning enabled for your ALB logs

  3. An S3 bucket that stores your ALB logs

  4. SAM CLI installed on your local machine

  5. AWS CLI and permissions to deploy the resources

These prerequisites ensure that you have the necessary infrastructure in place to query your ALB logs and deploy the serverless application.


Deployment Instructions


Ready to simplify your ALB log analysis? Follow these steps to deploy the application:

  1. Clone the repository to your local machine : https://github.com/RB-MRTZV/albLogLens.git

  2. Open the template.yaml file and make the following changes:

    1.       Replace Athena_DB_Name with your Athena database name

    2.       Update Athena_Table_Name to match your ALB logs table

    3.       Set Query_Period_In_Days to your desired query range.

    4.       Replace the S3 bucket ARN with your ALB logs bucket

    5.       Choose a unique name for the AthenaQueryResultS3Bucket

  3. Open a terminal and navigate to the project directory

  4. Run the following commands:


  • sam build --use-container

  • sam deploy


5. Monitor the deployment progress in your CLI and AWS CloudFormation console

6. Once deployed, manually run the albReports Lambda function to generate initial results


7. Replace your API Endpoint in the index.html

8 . Open the index.html file locally to view your ALB log insights



Enhancing Security with API Keys


To protect your API from unauthorised access and potential abuse, it's crucial to implement API key authentication. This section will guide you through the process of adding API key security to your deployment.


Why Use API Keys?


API keys offer several benefits:

  1. Access Control: Only clients with a valid API key can make requests

  2. Usage Tracking: Monitor and analyse API usage per key

  3. Throttling: Implement rate limiting to prevent abuse


Implementing API Key Security


Follow these steps to add API key protection to your ALB log analysis API:


Open the templatewithApiKey.yaml file in your project directory, and follow the instructions.


Build and deploy the template using SAM CLI:


  • sam build --use-container

  • sam deploy


After deployment, retrieve your API key from the AWS Console or using the AWS CLI:

aws apigateway get-api-keys --include-values

Updating the Frontend


To use the API key in your requests, you'll need to update the index.html file:

  1. Open index.html in your text editor.

  2. Locate the fetch function or wherever you're making API calls.

  3. Replace 'YOUR_API_KEY_HERE' with the actual API key you retrieved earlier.


Security Considerations


  • Keep your API key secure and don't expose it in public repositories.

  • For production use, consider implementing a more robust authentication system, such as OAuth or JWT.

  • Regularly rotate your API keys to minimise the impact of potential key exposure.

By implementing API key security, you've significantly enhanced the protection of your ALB log analysis solution. This ensures that only authorised users can access your valuable insights while also providing you with better control and monitoring capabilities.




Conclusion


With this serverless ALB log analysis solution, you can say goodbye to manual log parsing and hello to automated insights. By leveraging AWS services like Lambda, Athena, and DynamoDB, we've created a scalable, cost-effective way to gain visibility into your application's traffic patterns and performance.

Give it a try, and see how it can transform your approach to ALB log analysis. Have questions or suggestions? Feel free to contribute to the project on GitHub or reach out in the comments below.



Aug 6, 2024

4 min read

1

33

0

Comments

Share Your ThoughtsBe the first to write a comment.
bottom of page