Deploying Async LLM-Chat with Message History

1. Redis Memory Channel Layer

There is a Django web application with ReactJS as the frontend, which presents a chatbot with LLM using OpenAI’s API.

The chat has a memory layer as well, allowing the user to have longer consistent conversations with the chatbot.

The main tools of the application are channels in Django and a client-side WebSocket listener in React, which are the key components for implementing asynchronous messaging patterns.

This application works well on localhost, but if we want to deploy it, we need to change the channel layer from In-memory to Redis, as suggested by the documentation.

GitHub repository is available here

Change the In-memory to Redis Channel Layer

As the first step, we need to install channels_redis

pip install redis channels_redis

In order to use Redis for the channel layer, we need to modify the code in several places. First of all, we need to update the Django file that contains the class for handling the chat.

# chat/chat_api.py

from openai import OpenAI
from environs import Env
import redis  # Redis Channel Layer
import json  # Redis Channel Layer

# Load the environment variables
env = Env()
env.read_env()

client = OpenAI()
client.api_key=env.str("OPENAI_API_KEY")

class AiChat():

    _role = "You are helpful and friendly. Be short but concise as you can!"
    
    # Redis Channel Layer
    _redis_client = redis.Redis(host='redis', port=6379, db=0)  

    def __init__(self, prompt: str, model: str, channel: str) -> None:
        self.prompt = prompt
        self.model = model
        self.channel = channel

        ## Redis Channel Layer
        
        # Check if the channel exists in Redis
        if not self._redis_client.exists(channel):
            initial_data = [{"role": "system", "content": self._role}]
            self._redis_client.set(channel, json.dumps(initial_data))
        
        # Retrieve the conversation from Redis
        conversation_data = self._redis_client.get(channel)
        self.conversation = json.loads(conversation_data) if conversation_data else initial_data


    def chat(self) -> str:
        if self.prompt:
            # The conversation is going on ...
            # Adding prompt to chat history
            self.conversation.append({"role": "user", "content": self.prompt})
            # Redis Channel Layer
            self._redis_client.set(self.channel, json.dumps(self.conversation))
            # The OpenAI's chat completion generates answers to your prompts.
            completion = client.chat.completions.create(
                model=self.model,
                messages=self.conversation
            )
            answer = completion.choices[0].message.content
            # Adding answer to chat history
            self.conversation.append({"role": "assistant", "content": answer})
            # Redis Channel Layer
            self._redis_client.set(self.channel, json.dumps(self.conversation))
            return answer

The places where I modified the original code are marked with the # Redis Channel Layer comment.

Next, we need to change the channel layer in ./config/settings.py from In-memory to Redis Channel Layer and set the host to the Redis port.

# config/settings.py

from environs import Env

# Load the environment variables
env = Env()
env.read_env()

# ...

# Channels

CHANNEL_LAYERS = {
    "default": {
        "BACKEND": "channels_redis.core.RedisChannelLayer",
        "CONFIG": {
            "hosts": [(env.str('REDISHOST', default="redis"), 6379)],
        },           
     }    
 }

The “REDISHOST” environment variable provided from the cloud platform, while “redis” is the default for local development, as you will see in the next section.

2. Developing

In order to run Redis without installation, I am using Redis as a service in a docker-compose.yml file with a Dockerfile for the container image.

The local environment and specific Python files are not necessary for the container, so I could use a .dockerignore file in this case. However, especially for development purposes, hot reloading is more convenient.

Before building the Docker image, I update the requirements.txt file with the latest installations.

pip freeze > requirements.txt

Dockerfile

# Pull base image
FROM python:3.11-slim-bullseye

# Set environment variables
ENV PIP_DISABLE_PIP_VERSION_CHECK 1
ENV PYTHONDONTWRITEBYTECODE 1
ENV PYTHONUNBUFFERED 1

# Set work directory
WORKDIR /app

# Install dependencies
COPY ./requirements.txt .
RUN pip install --no-cache-dir -r requirements.txt

# Copy project
COPY . /app/

EXPOSE 8000

docker-compose.yml

services:
  backend:
    build: .
    container_name: llmchat
    command: python manage.py runserver 0.0.0.0:8000
    volumes:
      - .:/app  # hot reloading, overrides .dockerignore!
    ports:
      - 8000:8000
    depends_on:
      - redis
  redis:
    image: redis:latest
    container_name: llmchat_redis
    command: redis-server /usr/local/etc/redis/redis.conf
    volumes:
      - ./redis.conf:/usr/local/etc/redis/redis.conf
    ports:
      - '6379:6379'

Run the application with the next Docker commands:

docker build -t llmchat-prod .

docker-compose build

docker-compose up

3. Deploying

This application will be deployed to Railway.

Create Procfile

Install gunicorn

pip install gunicorn

Procfile

web: daphne -b 0.0.0.0 -p 8080 config.asgi:application

Install additional packages/libs required by daphne server

pip install twisted[tls,http2]

Environment variables

Django

Set DEBUG=True, SECRET_KEY in .env. Update also config/settings.py with SECRET_KEY = env.str('SECRET_KEY') and DEBUG = env.bool('DEBUG', default=False).

Create a Django app with the name “React”. *It doesn’t necesarry, you can add the view to another app as well.

python manage.py startapp react

After the app is done, don’t forget to add it to INTSALLED_APPS=[ ..., react, ] in config/settings.py.

1. Checking host and protocol dynamically

For checking host, instead of setting the environment variables in React, use the get_host() method and HTTP_X_FORWARDED_PROTO in Django.

# react/views.py

from django.views.generic import TemplateView

class IndexView(TemplateView):
    template_name = 'index.html'

    def get_context_data(self, **kwargs):
        context = super().get_context_data(**kwargs)
        
        # Check for forwarded protocol (for proxies like Railway)
        forwarded_proto = self.request.META.get('HTTP_X_FORWARDED_PROTO')
        is_secure = self.request.is_secure() or forwarded_proto == 'https'
        
        # Set WS/WSS protocol
        ws_protocol = 'wss://' if is_secure else 'ws://'
        context['WS_URL'] = f"{ws_protocol}{self.request.get_host()}"
        
        return context

Change the chat.views.React url-pattern to react.views.IndexView in config/urls.py

# config/urls.py
# ...
from react.views import IndexView

urlpatterns = [
  
  # ...
  
  re_path(r'^.*$', IndexView.as_view()),

]

# config/settings.py

SECURE_PROXY_SSL_HEADER = ('HTTP_X_FORWARDED_PROTO', 'https')

<!-- frontend/src/index.html -->
<!doctype html>
<html lang="en">
  <head>
    <meta charset="UTF-8" />
    <link rel="icon" type="image/svg+xml" href="/vite.svg" />
    <meta name="viewport" content="width=device-width, initial-scale=1.0" />
    <title>Vite + React</title>
    <script> 
      window.WS_URL =  WS_URL|safe ;
    </script>
  </head>
  <body>
    <div id="root"></div>
    <script type="module" src="/src/main.jsx"></script>
  </body>
</html>

// frontend/src/AI/chat.jsx

// Update websocket instance
const websocket = new WebSocket(window.WS_URL + '/ws/chat/');

React

React var names: The variable names with CRA need to start with REACT_APP, while in the case of Vite, they should start with VITE_.

Note: It seems multi-stage builds don’t work in Railway. By the way, it is worth trying to install and build Node inside a Python Docker image.

Serving static files

# config/settings.py

# Static files ...

STATIC_ROOT = str(BASE_DIR.joinpath('staticfiles'))  # production

Adding whitenoise to collect the static files of the project

pip install whitenoise

# config/settings.py

MIDDLEWARE = [
    # ...
    "django.middleware.security.SecurityMiddleware",
    "whitenoise.middleware.WhiteNoiseMiddleware",
    # ...
]

Collect static files with Django command

python manage.py collectstatic --noinput

Freezing to requirements before deploying

pip freeze > requirements.txt

GitHub

Commit and Push the files to a repo in GitHub.

Configuring Railway

For building use Railway builder, so add Dockerfile docker-compose.yml and any Docker-related files for example .dockerignore to .gitignore
The builder in Railway recognizes the missing Docker-related files and starts to build.
Set Railway environment variables: SECRET_KEY, OPENAI_API_KEY , REDISHOST
Set Custom Start Command in Settings > Deploy: web: daphne -b 0.0.0.0 -p 8080 config.asgi:application. Websocket port and URL port should be the same.
Update src/Chat.jsx with the new websocket url: 'wss://... .up.railway.app/ws/chat/'; => window.WS_URL + '/ws/chat/'. Don’t forget to run build and collectstatic!
Set ALLOWED_HOSTS=[‘… .up.railway.app’] and CSRF_TRUSTED_ORIGINS=[‘https://… .up.railway.app’] in config/settings.py with the Railway’s data.
Drag & drop docker-compose.yml onto your project canvas, which should contain only the Redis service.

If any errors occur, check the build and deploy logs in Deployments > View logs!