Consult

Consult is an LLM-powered tool to automate the processing of public consultations.

Important

Incubation Project: This project is an incubation project; as such, we don't recommend using this for critical use cases yet. We are currently in a research stage, trialling the tool for case studies across the Civil Service. If you are a civil servant and wish to take part in our research stage, please contact us at [email protected].

This repository is a work in progress, containing a Django app to visualise and explore consultation data and LLM-generated themes.

For our core AI-pipeline used for topic modelling to classify consultation responses into themes, please see our themefinder Python package on PyPi.

Setting up the application

External dependencies

Docker (brew install docker)
GraphViz (brew install graphviz), for generating diagrams
precommit (brew install pre-commit)

Installation instructions assume using a Mac with Homebrew.

Clone the repo

git clone [email protected]:i-dot-ai/consult.git

In the new repo install pre-commit:

cd consult

pre-commit install

Pre-commit identifies some potential secrets on commit (but will not catch all potential sensitive information).

VSCode setup (recommended)

This project includes VSCode configuration files to ensure consistent development experience:

.vscode/settings.json - Workspace settings for formatting, linting, and language support
.vscode/extensions.json - Recommended extensions for the project

When you open the project in VSCode, you'll be prompted to install recommended extensions. These include:

Python - Python language support with Poetry integration
Ruff - Python linter and formatter
ESLint - JavaScript/TypeScript linter
Prettier - JavaScript/TypeScript code formatter
Astro - Astro framework support
Svelte - Svelte framework support
Tailwind CSS IntelliSense - Tailwind CSS tooling

The workspace settings are configured to:

Format code on save (using appropriate formatter per language)
Auto-fix ESLint issues on save
Enable TypeScript support in Svelte files

You can override these settings in your User Settings if you prefer different personal configurations. See the VSCode settings documentation for more information on the settings hierarchy.

Environment variables

Populate .env by copying .env.test and filling in required values.

Install packages

Ensure you have python 3.12.3, poetry and npm installed. Then run poetry install, and npm install.

Database setup

docker compose up -d postgres

This will run the postgresql locally.

make dev_environment

This will set up dev and test databases with dummy data. See the definition of that make task for the various steps.

It will also set up the admin account to dev environment.

You will have an staff user (i.e. one that can access the admin) created with the username [email protected] and the password admin.

Confirm everything is working with

make check_db

(You can see all the available make commands by running bare make or make help).

Run the application

Make sure redis is running:

docker compose up -d redis

The database should also be running as described above.

Then run:

make serve

Go to http://localhost:8000 in the browser.

Run the tests

make test

The database

Generating dummy data

Only run this in development. Will create a consultation with 100 complete responses in a variety of question formats. This runs as part of make dev_environment, but you can run it more than once.

make dummy_data

Or go to /support/consultations/ and generate a dummy consultation from there.

Database migrations and schema diagram

If you use the make migrate command to run migrations, the diagram below will be regenerated automatically. If you need to generate it outside that process, you can run manage.py generate_erd. (You will need graphviz installed: see pydot docs).

Login

Magic links

You can sign into the application using a magic link, requested via /sign-in/. You need to have a user set-up first - add new users in /support/users/ (only be done by staff members).

When running locally, you can create your first admin user using make dev_admin_user, on dev/pre-prod/prod ask one of the existing members of the team.

For convenience, in local dev environments the value of the magic link will be logged along with the rest of the server logs.

The frontend

CSS

We depend on govuk-frontend for GOV.UK Design System styles.

django-compressor should work automatically to compile the govuk-frontend SCSS on the first request and any subsequent request after the SCSS has changed. In the meantime it will read from frontend/CACHE, which is .gitignored.

In production, we prepopulate frontend/CACHE using manage.py compress which will mean that every request is served from the cache.

django-compressor also takes care of fingerprinting and setting cache headers for our CSS.

JS

If you have made changes to the Lit components, run npm run build-lit to see changes

Fonts and images

The govuk assets are versioned in the npm package. make dev_environment includes a step to copy them to the frontend folder from where runserver can serve them; you can rerun this with make govuk_frontend.

Docs

We are using adr-tools to manage "Architectural Decision Records" - to track decisions made. To install and use: https://github.com/npryce/adr-tools.

Support area

The support area is for admin use - adding users, running imports, giving users permissions to consultations etc.

Access the support area by going to /support/. You will need to be a "staff user" to access it.

If you are running locally, you can create a staff user by running make dev_admin_user - which creates an admin user (as described above).

On any environment, if you are a staff user, you can give other users permission to access the support area. Go to /support/users/.

Importing data

Data import format

Data should be stored in the appropriate S3 bucket (AWS_DATA_BUCKET) and within a folder called app_data/consultations/.

It should be stored in the following structure for a given consultation:

<consultation-name>/
    ├── raw_data/
    │   └── ....
    ├── inputs/
    │   ├── question_part_<id>/
    │   │   ├── responses.jsonl
    │   │   ├── multi_choice.jsonl
    │   │   └── question.json
    │   ├──  question_part_<id>/
    │   │   ├── responses.jsonl
    │   │   └── question.json
    │   ├── ...
    │   └── respondents.jsonl
    └── outputs/
        ├── mapping/
        │   ├── <timestamp>/
        │   │   ├── question_part_<id>/
        │   │   │   ├── detail_detection.jsonl
        │   │   │   ├── mapping.jsonl
        │   │   │   ├── sentiment.jsonl
        │   │   │   └── themes.json
        │   │   ├── question_part_<id>/
        │   │   ├── ...
        │   ...
        └── sign_off/

Note that we have the notion of "question part" reflects historic notation, this represents one question.

The format for each of these files is in consultation_analyser/consultations/import_schema. Some of the files are JSONL files - JSONLines. The schema are given in JSON Schema format. In Python you can use the jsonschema library to validate a JSON instance.

Format of each of the files:

respondents.jsonl - this is a JSONL file per consultation, where each entry is the format given in the respondent.json schema.
responses.jsonl - this is a JSONL file per question, where each entry is in the format given in the response.json schema.
multichoice.jsonl - this is JSONL file per question (only if the question has a multiple choice part), where each entry is in the format given in the response.json schema.
question.json - this is a JSON file per question, and this must satisfy the format given in the question.json schema.
detail_detection.jsonl - this a JSONL file per question and run of themefinder, each row is a line in the format of detail_detection.json with one row per response.
mapping.jsonl - this is a JSONL file per question and run of themefinder, each row is a line in the format of mapping.json. Each row maps a given response to its themes.
sentiment.jsonl - this a JSONL file per question part and run of themefinder, each row is a line in the format of sentiment.json with one row per response.
themes.json - this gives the themes for a given question part and run of themefinder, with theme_key as a unique identifier for a theme (for a given question). This is the format given by the themes.json schema.

Data import process

You must be a staff user to import consultation data. The consultation data consists of input data on questions, responses, and respondent data, and output data which is further data on themes from themefinder.

Data should be stored in a specific structure in S3 (bucket specified by AWS_BUCKET_NAME) as described above.

Run this locally first to ensure the data is valid before running in production.

The import should be run in stages, which can be navigated to from support/consultations/import-summary/:

Create a consultation and import data on respondents support/consultations/import-consultation.
The import is running asynchronously - you can check its progress by looking at the queue in support/django-rq/.

If the import fails half-way, delete the consultation or question (which will delete all related objects) and re-import. This can be done by navigating to the individual consultation from /support/consultations/.

To run locally you must have access to your AWS account

Testing

Backend Tests

make test

Frontend Unit Tests

Run the below command to run all unit tests for front-end components, using Storybook as test runner.

npm run storybook-test

Each component also displays its test cases inside the "Component tests" panel when Storybook is viewed on the browser.

End-to-End Tests

End-to-end tests are located in the /e2e_tests folder and use Playwright to test the full application flow.

To run e2e tests:

make test-end-to-end

Or run with UI:

npm run e2e-ui

The e2e tests run against docker-compose services and test complete user journeys including authentication, navigation, and form submissions.

Name		Name	Last commit message	Last commit date
Latest commit History 4,156 Commits
.github		.github
.storybook		.storybook
.vscode		.vscode
consultation_analyser		consultation_analyser
docs		docs
e2e_tests		e2e_tests
frontend		frontend
lambda		lambda
legacy-frontend		legacy-frontend
pipeline-mapping		pipeline-mapping
pipeline-sign-off		pipeline-sign-off
terraform		terraform
tests		tests
.adr-dir		.adr-dir
.cruft.json		.cruft.json
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.env.test		.env.test
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
Dockerfile		Dockerfile
LICENSE		LICENSE
Makefile		Makefile
Procfile		Procfile
README.md		README.md
build-lit.mjs		build-lit.mjs
docker-compose.yml		docker-compose.yml
jest.config.js		jest.config.js
json-schema-faker-options.js		json-schema-faker-options.js
manage.py		manage.py
package-lock.json		package-lock.json
package.json		package.json
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
rollup.config.mjs		rollup.config.mjs
start-worker.sh		start-worker.sh
start.sh		start.sh
web-test-runner.config.mjs		web-test-runner.config.mjs

License

i-dot-ai/consult

Folders and files

Latest commit

History

Repository files navigation

Consult

Setting up the application

External dependencies

Clone the repo

VSCode setup (recommended)

Environment variables

Install packages

Database setup

Run the application

Run the tests

The database

Generating dummy data

Database migrations and schema diagram

Login

Magic links

The frontend

CSS

JS

Fonts and images

Docs

Support area

Importing data

Data import format

Data import process

Testing

Backend Tests

Frontend Unit Tests

End-to-End Tests

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 42

Packages 0

Uh oh!

Contributors 23

Uh oh!

Languages

Packages