Step-by-Step Approach to Text Translation and Summarization Using LLaMa-2

November 29, 2023

Key Benefits of Using LLaMa-2

High Accuracy and Fluency: LLaMa-2 is trained on a massive dataset of text and code, which allows it to generate text that is both accurate and fluent.
Efficiency and Accessibility: LLaMa-2 is designed to be more efficient and accessible than other LLMs. It can be trained on larger datasets and run on a variety of hardware platforms.
Open Source: LLaMa-2 is open source, which means that anyone can use it for research or commercial purposes.

LLaMa-2 is a powerful new tool for natural language processing. It has the potential to be used in a wide range of applications, such as machine translation, text summarization, question answering, code generation, and creative writing.

Methods of Using LLaMa-2

There are two methods of using LLaMa-2:

Download the model with the correct instructions and link provided in the email (the hard way, and works only if you have a decent GPU), or
Use Hugging Face and a cloud GPU notebook. (E2E Networks gives you a robust option.)

In this article, we will go through the second method, that is, using the huggingface library in E2E TIR Notebook.

To use LLaMA-2 in a TIR Notebook, we first need to install the necessary packages:


!pip install -q transformers einops accelerate langchain bitsandbytes

Then we need to log in to Hugging Face:


!huggingface-cli login‍

Get the token number using your id; it is free to use, and now we can download the LLaMA-2 model.

Text Summarization

Text summarization is a crucial task in natural language processing (NLP) that extracts the most important information from a text while retaining its core meaning. In recent years, various techniques and models have been developed to automate this process, making it easier to digest large volumes of text data.

This article proposes a solution for text summarization using LLaMA-2 locally, without using cloud services or exposing your documents to third-party applications or OpenAI's models. We will explore the capabilities of LLaMA-2 and demonstrate how it can streamline your multiple document summarization needs.


from langchain import HuggingFacePipeline
from transformers import AutoTokenizer
import transformers
import torch
model = "meta-llama/Llama-2-7b-chat-hf"
tokenizer = AutoTokenizer.from_pretrained(model)
pipeline = transformers.pipeline(
    "text-generation", 
    model=model,
    tokenizer=tokenizer,
    torch_dtype=torch.bfloat16,
    trust_remote_code=True,
    device_map="auto",
    max_length=1000,
    eos_token_id=tokenizer.eos_token_id)

This HuggingFacePipeline will now allow us to use the LLaMA-2 model in our notebook.

Here is a brief explanation of each step:

Installing the necessary packages: The pip install command installs the Python packages that we need to use LLaMA-2. These packages include transformers, einops, accelerate, langchain, and bitsandbytes.
Logging in to Hugging Face: The huggingface-cli login command logs us in to Hugging Face. This is necessary because we need to download the LLaMA-2 model from the Hugging Face Hub.
Downloading the LLaMA-2 model: The code in this section downloads the LLaMA-2 model from the Hugging Face Hub. We specify the model name (model = "meta-llama/Llama-2-7b-chat-hf") and the tokenizer (tokenizer = AutoTokenizer.from_pretrained(model)).
Creating a HuggingFacePipeline out of the model: The code in this section creates a HuggingFacePipeline out of the LLaMA-2 model. This Pipeline will allow us to use the model in our notebook.

Once we have created the HuggingFacePipeline, we can start using the LLaMA-2 model.


template = """ Write a concise summary of the following text delimited by triple backquotes.Return your response in bullet points which covers the key points of the text.
```{text}```
BULLET POINT SUMMARY:
"""    
prompt = PromptTemplate(template=template, input_variables=["text"])
llm_chain = LLMChain(prompt=prompt, llm=llm)
print(llm_chain.run(text))

You can pass your text inside the function and generate a summary of your own. For example, see below.

Given text:

"""Arsenal take on Stoke City in Barclays Under 21 Premier League clash .

Jack Wilshere and club captain Mikel Arteta have been out since November .

Abou Diaby has been ravaged by injuries during nine-year spell at club .

Arteta, Wilshere and Diaby are all close to first-team returns .

Young winger Serge Gnabry also in the side on return from injury .

READ: Arsenal's Alex Oxlade-Chamberlain, Calum Chambers, Jack Wilshere and Danny Welbeck keep their agents close."""

Generated output:

Arsenal take on Stoke City in Barclays Under 21 Premier League clash.

• Jack Wilshere and club captain Mikel Arteta have been out since November.

• Abou Diaby has been ravaged by injuries during his nine-year spell at the club.

• Arteta, Wilshere, and Diaby are all close to first-team returns.

• Young winger Serge Gnabry is also in the side on his return from injury.

• Alex Oxlade-Chamberlain, Calum Chambers, Jack Wilshere, and Danny Welbeck keep their agents close.

Text Translation

Similarly, we can change the prompt as follows to generate an output for text translation.

template = """ Translate the following sentence from English to French:

```{text}```

TRANSLATED SENTENCE:

"""

prompt = PromptTemplate(template=template, input_variables=["text"])

llm_chain = LLMChain(prompt=prompt, llm=llm)

print(llm_chain.run(text))

For example, see below.

Given text:

The quick brown fox jumps over the lazy dog.

Generated output:

``` Le renard brun rapide saute sur le chien endormi.```

Explanation:

* "The quick brown fox" becomes "Le renard brun rapide" in French, where "renard" means "fox" and "brun" means "brown".

* "jumps" becomes "saute" in French, which means "jumps" in English.

* "over" becomes "sur" in French, which means "over" in English.

* "the lazy dog" becomes "le chien endormi" in French, where "chien" means "dog" and "endormi" means "lazy".

So the translated sentence is "Le renard brun rapide saute sur le chien endormi".

Conclusion

In this article, we have explored the capabilities of LLaMA-2 and demonstrated how it can be used for text summarization locally. We have also discussed the benefits of using LLaMA-2 for this task, such as its accuracy, speed, and ease of use.

LLaMA-2 is a powerful tool that can help individuals and businesses to streamline their multiple document summarization needs. It is ideal for tasks such as summarizing news articles, research papers, and other types of documents.

LLaMA-2 has the potential to revolutionize the way we interact with text data. By making it possible to summarize text quickly and accurately, LLaMA-2 can help us to make better decisions and be more productive.

To unlock the full potential of LLaMA-2 and other advanced AI models without the complexity of managing infrastructure, try the TIR AI model deployment platform by E2E Networks.

Sign up for Free Trial

Latest Blogs

A vector illustration of a tech city using latest cloud technologies & infrastructure

Step-by-Step Approach to Text Translation and Summarization Using LLaMa-2

November 29, 2023

Tanaya Pakhale

Key Benefits of Using LLaMa-2

High Accuracy and Fluency: LLaMa-2 is trained on a massive dataset of text and code, which allows it to generate text that is both accurate and fluent.
Efficiency and Accessibility: LLaMa-2 is designed to be more efficient and accessible than other LLMs. It can be trained on larger datasets and run on a variety of hardware platforms.
Open Source: LLaMa-2 is open source, which means that anyone can use it for research or commercial purposes.

Methods of Using LLaMa-2

There are two methods of using LLaMa-2:

Download the model with the correct instructions and link provided in the email (the hard way, and works only if you have a decent GPU), or
Use Hugging Face and a cloud GPU notebook. (E2E Networks gives you a robust option.)

In this article, we will go through the second method, that is, using the huggingface library in E2E TIR Notebook.

To use LLaMA-2 in a TIR Notebook, we first need to install the necessary packages:


!pip install -q transformers einops accelerate langchain bitsandbytes

Then we need to log in to Hugging Face:


!huggingface-cli login‍

Get the token number using your id; it is free to use, and now we can download the LLaMA-2 model.

Text Summarization


from langchain import HuggingFacePipeline
from transformers import AutoTokenizer
import transformers
import torch
model = "meta-llama/Llama-2-7b-chat-hf"
tokenizer = AutoTokenizer.from_pretrained(model)
pipeline = transformers.pipeline(
    "text-generation", 
    model=model,
    tokenizer=tokenizer,
    torch_dtype=torch.bfloat16,
    trust_remote_code=True,
    device_map="auto",
    max_length=1000,
    eos_token_id=tokenizer.eos_token_id)

This HuggingFacePipeline will now allow us to use the LLaMA-2 model in our notebook.

Here is a brief explanation of each step:

Installing the necessary packages: The pip install command installs the Python packages that we need to use LLaMA-2. These packages include transformers, einops, accelerate, langchain, and bitsandbytes.
Logging in to Hugging Face: The huggingface-cli login command logs us in to Hugging Face. This is necessary because we need to download the LLaMA-2 model from the Hugging Face Hub.
Downloading the LLaMA-2 model: The code in this section downloads the LLaMA-2 model from the Hugging Face Hub. We specify the model name (model = "meta-llama/Llama-2-7b-chat-hf") and the tokenizer (tokenizer = AutoTokenizer.from_pretrained(model)).
Creating a HuggingFacePipeline out of the model: The code in this section creates a HuggingFacePipeline out of the LLaMA-2 model. This Pipeline will allow us to use the model in our notebook.

Once we have created the HuggingFacePipeline, we can start using the LLaMA-2 model.


template = """ Write a concise summary of the following text delimited by triple backquotes.Return your response in bullet points which covers the key points of the text.
```{text}```
BULLET POINT SUMMARY:
"""    
prompt = PromptTemplate(template=template, input_variables=["text"])
llm_chain = LLMChain(prompt=prompt, llm=llm)
print(llm_chain.run(text))

You can pass your text inside the function and generate a summary of your own. For example, see below.

Given text:

"""Arsenal take on Stoke City in Barclays Under 21 Premier League clash .

Jack Wilshere and club captain Mikel Arteta have been out since November .

Abou Diaby has been ravaged by injuries during nine-year spell at club .

Arteta, Wilshere and Diaby are all close to first-team returns .

Young winger Serge Gnabry also in the side on return from injury .

READ: Arsenal's Alex Oxlade-Chamberlain, Calum Chambers, Jack Wilshere and Danny Welbeck keep their agents close."""

Generated output:

Arsenal take on Stoke City in Barclays Under 21 Premier League clash.

• Jack Wilshere and club captain Mikel Arteta have been out since November.

• Abou Diaby has been ravaged by injuries during his nine-year spell at the club.

• Arteta, Wilshere, and Diaby are all close to first-team returns.

• Young winger Serge Gnabry is also in the side on his return from injury.

• Alex Oxlade-Chamberlain, Calum Chambers, Jack Wilshere, and Danny Welbeck keep their agents close.

Text Translation

Similarly, we can change the prompt as follows to generate an output for text translation.

template = """ Translate the following sentence from English to French:

```{text}```

TRANSLATED SENTENCE:

"""

prompt = PromptTemplate(template=template, input_variables=["text"])

llm_chain = LLMChain(prompt=prompt, llm=llm)

print(llm_chain.run(text))

For example, see below.

Given text:

The quick brown fox jumps over the lazy dog.

Generated output:

``` Le renard brun rapide saute sur le chien endormi.```

Explanation:

* "The quick brown fox" becomes "Le renard brun rapide" in French, where "renard" means "fox" and "brun" means "brown".

* "jumps" becomes "saute" in French, which means "jumps" in English.

* "over" becomes "sur" in French, which means "over" in English.

* "the lazy dog" becomes "le chien endormi" in French, where "chien" means "dog" and "endormi" means "lazy".

So the translated sentence is "Le renard brun rapide saute sur le chien endormi".

Conclusion

To unlock the full potential of LLaMA-2 and other advanced AI models without the complexity of managing infrastructure, try the TIR AI model deployment platform by E2E Networks.

Sign up for Free Trial

Latest Blogs

Step-by-Step Approach to Text Translation and Summarization Using LLaMa-2

Table of Contents

Key Benefits of Using LLaMa-2

Methods of Using LLaMa-2

Text Summarization

Text Translation

Conclusion

Step-by-Step Approach to Text Translation and Summarization Using LLaMa-2

Table of Contents

Key Benefits of Using LLaMa-2

Methods of Using LLaMa-2

Text Summarization

Text Translation

Conclusion

How Does RAG Improve the Accuracy of LLM Responses?

Top 10 Cloud GPU Providers in 2025

What is Retrieval-Augmented Generation (RAG)?

AI Inference vs Training: Understanding Key Differences

Sovereign Cloud: India's Key to Digital Independence in the AI Age

E2E Sovereign Cloud Platform: Revolutionizing Cloud Sovereignty

Top 8 Generative AI Applications in 2025

A Comparison between TIR Containerized VMs vs Traditional VMs

Accelerate Your AI Application Development Using TIR Containerized VMs

The AI Revolution in the Automotive Industry: Steering Toward a Smarter, Safer, and Sustainable Future