Integrate Oracle AI Vector DB and OCI GenAI with Llama-index and LangChain

The UI of the Knowledge Assistant you can build using following examples.

This repository contains all the work done on the development of RAG applications using:

Oracle AI Vector Search
Oracle OCI GenAI Service
Oracle OCI Embeddings
Cohere Reranking
Reranker models deployed in OCI Data Science
OCI ADS 2.10.0 (with support for OCI GenAI)
llama-index
LangChain

In the Video demos section of the Wiki you'll find some video of the demo.

What is RAG?

A very good introduction to what Retrieval Augmented Generation (RAG) is can be found here

Features

basic (12/2023) integration between Oracle DB Vector Store (23c) and llama-index
All documents stored in an Oracle DB
Oracle AI Vector Search used for Semantic Search
Reranking to improve retrieval
How to show references (documents used for the response generation)
(30/12) Added reranker implemented as OCI Model Deployment
(20/01) Added implementation of Vector Store for LangChain and demo
Finding duplicates in the documentation
(2/03/2024) Added Phoenix Traces for observability

Demos

demo1 This NB shows a demo where you get answers to questions on Oracle Database and new features in 23c
demo2 This NB shows a QA demo on Medicine (Covid19), using Public docs from NIH.
Bot powered by Oracle Vector DB and OCI GenAI
demo3 shows how to add a Reranker to the RAG chain; I have used Cohere
demo5 shows a full rag chain where the reranker is deployed on OCI DS
LangChain demo based on Oracle Vector DB and LangChain
finding duplicates
Knowledge assistant full demo

Setup

See the wiki pages.

Loading data

You can use create_save_embeddings to load all the data in the Oracle DB schema.
You can launch it using the script load_books.
The list of files to be loaded is specified in the file config.py

You need to have pdf files in the same directory.

Limited Availability

(News) OCI GenAI is General Availability since 23/01/2024, see updated docs for setup

Oracle AI Vector Search (Vector DB) is a new feature in Oracle DB 23c, in Limited Availability.

Customers can easily enter in the LA programs.

To test these functionalities you need to enroll in the LA programs and install the proper versions of software libraries.

Code and functionalities can change, as a result of feedbacks from customers.

Releases used for the demo

OCI 2.119.1
OCI ADS 2.10.0
LangChain >= 0.1.4
LangChain Community >= 0.0.16
Llama-index >= 0.9.37.post1 < 0.10
Oracle Database 23c (23.4) free edition with Vector DB

You can install a complete Python environment using the instructions in the *Setup section of the Wiki.

Libraries

OCI Python SDK
OCI ADS
oracledb
Streamlit
Llama-index
LangChain
Arize-AI/phoenix for Observability and Tracing

Documentation

OCI GenAI in LangChain

Embeddings

One of the key pieces in RAG is the Retrieval module. To use the DB Vector Store you need an Embeddings Model: a model that does the magic of transforming text in vectors, capturing the content and the semantic of the text. The Embeddings Model used in these demos is Cohere Embeds V3, provided from OCI GenAI service.

With few changes you can switch to use an Open Source model. But you need to have the computational power to run it.

Observability

(02/03/2024) I have added integration with Arize Phoenix (Phoenix traces). To enable tracing in config.py you must set ADD_PHX_TRACING = True

In case of troubles with installing OpenTelemetry a quick solution is to disable it.

Factory methods

In the module prepare_chain are defined factory methods to create: embedder, llm, reranker...

The philosophy is to make things simpler. So all the configuration are taken from config.py.

If you want to change the llm, (or anything else) go to config.py. No params in the NBs

Name		Name	Last commit message	Last commit date
Latest commit History 160 Commits
LICENSE		LICENSE
README.md		README.md
Using OracleVectorDB with Llama-index.docx		Using OracleVectorDB with Llama-index.docx
add_another_book.ipynb		add_another_book.ipynb
config.py		config.py
create_save_embeddings.py		create_save_embeddings.py
create_tables.sql		create_tables.sql
custom_vector_store_demo1.ipynb		custom_vector_store_demo1.ipynb
custom_vector_store_demo2.ipynb		custom_vector_store_demo2.ipynb
custom_vector_store_demo3.ipynb		custom_vector_store_demo3.ipynb
custom_vector_store_demo4.ipynb		custom_vector_store_demo4.ipynb
custom_vector_store_demo6.ipynb		custom_vector_store_demo6.ipynb
custom_vector_store_rag_chain_demo5.ipynb		custom_vector_store_rag_chain_demo5.ipynb
database-concepts.pdf		database-concepts.pdf
demo_langchain1_add_book.ipynb		demo_langchain1_add_book.ipynb
demo_langchain2.ipynb		demo_langchain2.ipynb
deploy_reranker.ipynb		deploy_reranker.ipynb
evaluate_reranker.csv		evaluate_reranker.csv
evaluate_retrieval.ipynb		evaluate_retrieval.ipynb
find_duplicates.ipynb		find_duplicates.ipynb
knowledge_assistant_on_db_short.mp4		knowledge_assistant_on_db_short.mp4
load_books.sh		load_books.sh
oci_baai_reranker.py		oci_baai_reranker.py
oci_baai_reranker_general.py		oci_baai_reranker_general.py
oci_llama_reranker.py		oci_llama_reranker.py
oci_translator.py		oci_translator.py
oci_utils.py		oci_utils.py
oracle-database-23c-new-features-guide.pdf		oracle-database-23c-new-features-guide.pdf
oracle_bot.py		oracle_bot.py
oracle_chat_with_memory.py		oracle_chat_with_memory.py
oracle_vector_db.py		oracle_vector_db.py
oracle_vector_db_lc.py		oracle_vector_db_lc.py
prepare_chain.py		prepare_chain.py
prepare_chain_4_chat.py		prepare_chain_4_chat.py
rag_chain_demo5.ipynb		rag_chain_demo5.ipynb
requirements.txt		requirements.txt
run_oracle_bot.sh		run_oracle_bot.sh
run_oracle_chat_with_memory.sh		run_oracle_chat_with_memory.sh
screenshot.png		screenshot.png
test_chain_for_bot.ipynb		test_chain_for_bot.ipynb
test_chat.ipynb		test_chat.ipynb
test_chat_engine.ipynb		test_chat_engine.ipynb
test_chunks.ipynb		test_chunks.ipynb
test_embeddings.ipynb		test_embeddings.ipynb
test_metadata.ipynb		test_metadata.ipynb
test_oci_reranker.ipynb		test_oci_reranker.ipynb
test_register_books.ipynb		test_register_books.ipynb
test_translation.ipynb		test_translation.ipynb
~$ing OracleVectorDB with Llama-index.docx		~$ing OracleVectorDB with Llama-index.docx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Integrate Oracle AI Vector DB and OCI GenAI with Llama-index and LangChain

The UI of the Knowledge Assistant you can build using following examples.

What is RAG?

Features

Demos

Setup

Loading data

Limited Availability

Releases used for the demo

Libraries

Documentation

Embeddings

Observability

Factory methods

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

luigisaetta/llamaindex_oracle

Folders and files

Latest commit

History

Repository files navigation

Integrate Oracle AI Vector DB and OCI GenAI with Llama-index and LangChain

The UI of the Knowledge Assistant you can build using following examples.

What is RAG?

Features

Demos

Setup

Loading data

Limited Availability

Releases used for the demo

Libraries

Documentation

Embeddings

Observability

Factory methods

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages