Langchain + RAG cheat sheet

Paolo Morisot

Cover Image for Langchain + RAG cheat sheet

Paolo Morisot

November 5, 2024

Introduction to LangChain: Building Efficient LLM Applications

LangChain is a powerful ecosystem designed to simplify the development of applications based on Large Language Models (LLMs). Whether you're an experienced developer or new to artificial intelligence, LangChain offers tools and predefined chains to streamline your workflow. In this article, we'll explore the main features of LangChain, accompanied by code examples to help you get started.

1. Setting Up Hugging Face

Before diving in, make sure you have an account on Hugging Face. You'll need an API access token to interact with hosted models. Here's how to set it up:

Create an account on Hugging Face if you haven't already. Generate an API access token by visiting your settings page. Keep your token secure; you'll need it for API calls. Note: Never share your access token publicly.

2. Basic API Call to Hugging Face

Let's use an LLM hosted on Hugging Face for a simple prediction:

python

1from langchain.llms import HuggingFaceEndpoint
2
3# Replace with your API access token
4huggingfacehub_api_token = 'hf_your_access_token'
5
6# Define the LLM
7llm = HuggingFaceEndpoint(
8    endpoint_url='https://api-inference.huggingface.co/models/tiiuae/falcon-7b-instruct',
9    huggingfacehub_api_token=huggingfacehub_api_token
10)
11
12# Ask a question to the model
13question = 'What can I do to improve my productivity?'
14output = llm.invoke(question)
15
16print(output)

3. Using Prompt Templates

Prompt templates allow you to structure your queries flexibly. Here's how to create a simple template:

python

1from langchain.prompts import PromptTemplate
2from langchain.llms import HuggingFaceEndpoint
3
4# Create a prompt template
5template = "You are an artificial intelligence assistant. Answer the following question: {question}"
6prompt = PromptTemplate(template=template, input_variables=["question"])
7
8# Integrate the template with the LLM
9llm = HuggingFaceEndpoint(
10    endpoint_url='https://api-inference.huggingface.co/models/tiiuae/falcon-7b-instruct',
11    huggingfacehub_api_token=huggingfacehub_api_token
12)
13llm_chain = prompt | llm
14
15question = "How does LangChain simplify LLM application development?"
16print(llm_chain.invoke({"question": question}))

4. Managing Memory in Chat Models

LangChain offers several ways to manage conversation history:

a. ChatMessageHistory This class stores all messages exchanged during a conversation.

python

1from langchain.memory import ChatMessageHistory
2from langchain.chat_models import ChatOpenAI
3
4llm = ChatOpenAI(model_name="gpt-4", temperature=0)
5
6# Create the conversation history
7history = ChatMessageHistory()
8history.add_ai_message("Hello! Ask me any question about Python programming.")
9history.add_user_message("What is a list comprehension in Python?")
10response = llm.invoke(history.messages)
11print(response.content)
12b. ConversationBufferMemory
13This class stores a defined number of recent messages.
14
15```python
16from langchain.memory import ConversationBufferMemory
17from langchain.chains import ConversationChain
18from langchain.chat_models import ChatOpenAI
19
20llm = ChatOpenAI(model_name="gpt-4", temperature=0)
21
22# Define the buffer memory
23memory = ConversationBufferMemory(k=4)
24
25# Create the conversation chain
26buffer_chain = ConversationChain(llm=llm, memory=memory)
27
28# Interact with the model
29buffer_chain.predict(input="Explain decorators in Python.")
30buffer_chain.predict(input="Can you give an example with @staticmethod?")

5. Sequential Chains

Sequential chains allow you to link multiple processing steps. For example, creating a learning plan:

python

1from langchain.prompts import PromptTemplate
2
3# Template for the activity
4learning_prompt = PromptTemplate(
5    input_variables=["activity"],
6    template="I want to learn how to {activity}. Can you suggest steps to achieve this?"
7)
8
9# Template for the time constraint
10time_prompt = PromptTemplate(
11    input_variables=["learning_plan"],
12    template="I only have one week. Can you create a plan to reach this goal: {learning_plan}."
13)
14
15# Chain the templates
16from langchain.chains import SequentialChain
17chain = SequentialChain(chains=[learning_prompt, time_prompt])
18
19# Execute the chain
20output = chain.invoke({"activity": "play the piano"})
21print(output)

6. Agents

Agents make decisions based on the tools available to them. LangChain provides pre-built agents like the ReAct agent:

python

1from langchain.agents import load_tools, initialize_agent
2from langchain.chat_models import ChatOpenAI
3
4# Load tools
5tools = load_tools(["wikipedia"])
6
7# Define the LLM
8llm = ChatOpenAI(model_name="gpt-4", temperature=0)
9
10# Create the agent
11agent = initialize_agent(tools, llm, agent="zero-shot-react-description", verbose=True)
12
13# Use the agent
14response = agent.run("Summarize key facts about London, England.")
15print(response)

7. Creating Custom Tools for Agents

You can create your own tools to extend the capabilities of agents.

python

1
2from langchain.agents import tool
3
4# Example tool function
5@tool
6def retrieve_customer_info(name: str) -> str:
7    """Retrieve customer information based on their name."""
8    # Simulate a database
9    customers = {
10        "Peak Performance Co.": "Information about Peak Performance Co...",
11        "Innovatech Ltd.": "Information about Innovatech Ltd..."
12    }
13    return customers.get(name, "Customer not found.")
14
15# Create the agent with the custom tool
16agent = initialize_agent([retrieve_customer_info], llm, agent="zero-shot-react-description", verbose=True)
17
18# Use the agent
19response = agent.run("Create a summary for our customer: Peak Performance Co.")
20print(response)

8. Integrating Document Loaders

Document loaders allow you to import various types of data into your application.

a. Loading PDFs

python

1from langchain.document_loaders import PyPDFLoader
2
3# Load the PDF document
4loader = PyPDFLoader("rag_vs_fine_tuning.pdf")
5data = loader.load()
6print(data[0])

b. Loading CSVs

python

1
2from langchain.document_loaders import CSVLoader
3
4# Load the CSV file
5loader = CSVLoader("fifa_countries_audience.csv")
6data = loader.load()
7print(data[0])

c. Loading HTML

python

1from langchain.document_loaders import UnstructuredHTMLLoader
2
3# Load the HTML file
4loader = UnstructuredHTMLLoader("white_house_executive_order_nov_2023.html")
5data = loader.load()
6print(data[0])

9. Splitting Data for Retrieval

Splitting documents into smaller chunks facilitates data management and information retrieval.

a. CharacterTextSplitter

python

1from langchain.text_splitter import CharacterTextSplitter
2
3text = 'Words are flowing out like endless rain into a paper cup,\nthey slither while they pass,\nthey slip away across the universe.'
4chunk_size = 24
5chunk_overlap = 10
6
7# Create an instance of the splitter
8splitter = CharacterTextSplitter(
9    separator="\n",
10    chunk_size=chunk_size,
11    chunk_overlap=chunk_overlap
12)
13
14# Split the text and print the chunks
15chunks = splitter.split_text(text)
16print(chunks)
17print([len(chunk) for chunk in chunks])

b. RecursiveCharacterTextSplitter

python

1from langchain.text_splitter import RecursiveCharacterTextSplitter
2
3text = 'Words are flowing out like endless rain into a paper cup,\nthey slither while they pass,\nthey slip away across the universe.'
4chunk_size = 24
5chunk_overlap = 10
6
7# Create an instance of the splitter
8splitter = RecursiveCharacterTextSplitter(
9    separators=["\n"," ",""],
10    chunk_size=chunk_size,
11    chunk_overlap=chunk_overlap
12)
13
14# Split the text and print the chunks
15chunks = splitter.split_text(text)
16print(chunks)
17print([len(chunk) for chunk in chunks])

c. Splitting an HTML Document

python

1from langchain.document_loaders import UnstructuredHTMLLoader
2from langchain.text_splitter import RecursiveCharacterTextSplitter
3
4# Load the HTML document
5loader = UnstructuredHTMLLoader("white_house_executive_order_nov_2023.html")
6data = loader.load()
7
8chunk_size = 300
9chunk_overlap = 100
10
11# Split the HTML
12splitter = RecursiveCharacterTextSplitter(
13    chunk_size=chunk_size,
14    chunk_overlap=chunk_overlap,
15    separators='.'
16)
17
18docs = splitter.split_documents(data)
19print(docs)

10. RAG Storage and Retrieval Using a Vector Database

Retrieval-Augmented Generation (RAG) improves accuracy by using an external knowledge base.

a. Using ChromaDB

python

1from langchain.document_loaders import PyPDFLoader
2from langchain.text_splitter import RecursiveCharacterTextSplitter
3from langchain.vectorstores import Chroma
4from langchain.embeddings import OpenAIEmbeddings
5from langchain.prompts import ChatPromptTemplate
6from langchain.chains import RunnableMap, RunnablePassthrough
7from langchain.chat_models import ChatOpenAI
8import os
9
10# Load and split your documents
11loader = PyPDFLoader('rag_vs_fine_tuning.pdf')
12data = loader.load()
13splitter = RecursiveCharacterTextSplitter(chunk_size=300, chunk_overlap=50)
14docs = splitter.split_documents(data)
15
16# Create the vector database
17embedding_function = OpenAIEmbeddings(openai_api_key='your_openai_api_key')
18vectorstore = Chroma.from_documents(
19    docs,
20    embedding=embedding_function,
21    persist_directory=os.getcwd()
22)
23
24# Configure the retriever
25retriever = vectorstore.as_retriever(
26    search_type="similarity",
27    search_kwargs={"k": 3}
28)
29
30# Create the prompt template
31message = """
32Answer the following question using the context provided:
33
34Context:
35{context}
36
37Question:
38{question}
39
40Answer:
41"""
42
43prompt_template = ChatPromptTemplate.from_messages([("human", message)])
44
45# Define the LLM
46llm = ChatOpenAI(model_name="gpt-4", temperature=0)
47
48# Create the RAG chain
49rag_chain = RunnableMap({
50    "context": retriever,
51    "question": RunnablePassthrough()
52}) | prompt_template | llm
53
54# Execute the chain
55response = rag_chain.invoke("Which popular LLMs were considered in the paper?")
56print(response.content)

Conclusion LangChain is a powerful tool for developing complex LLM applications with simplicity and efficiency. By leveraging its features such as prompt templates, memory management, custom agents, and external data integration, you can create intelligent solutions tailored to your specific needs.

Start today and unlock the full potential of language models with LangChain!

Blog.

Langchain + RAG cheat sheet

Introduction to LangChain: Building Efficient LLM Applications

1. Setting Up Hugging Face

2. Basic API Call to Hugging Face

3. Using Prompt Templates

4. Managing Memory in Chat Models

5. Sequential Chains

6. Agents

7. Creating Custom Tools for Agents

8. Integrating Document Loaders

9. Splitting Data for Retrieval

10. RAG Storage and Retrieval Using a Vector Database