Newest 'llama3' Questions

1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

49 questions

4 votes

2 answers

752 views

No module named 'llama_models.cli.model' error while llama 3.1 8B downloading

I'm trying to install the LLaMA 3.1 8B model by following the instructions in the llamamodel GitHub README. When I run the command: llama-model download --source meta --model-id CHOSEN_MODEL_ID (...

alwayssaewoo's user avatar

alwayssaewoo

asked Nov 6, 2025 at 20:59

0 votes

0 answers

55 views

Running Ollama on local computer and prompting from jupyter notebook - does the model recall prior prompts like if it was the same chat?

I am doing some tests using Ollama on local computer, with Llama 3.2, which consists in prompting a task against a document. I read that after having reached maximum context, I should restart the ...

user305883's user avatar

user305883

1,739

asked Sep 23, 2025 at 23:35

0 votes

0 answers

50 views

Custom NER to extract header, request and response from API document

I'm trying to extract API integration parameters like Authorization headers, query params, and request body fields from API documentation. This is essentially a custom NER task. I’ve experimented with ...

Rukhma's user avatar

Rukhma

asked Jul 12, 2025 at 14:42

0 votes

1 answer

149 views

LLM-Agent: Tool calling problem after conversion from HuggingFace to Ollama for llama stack

I am using llama stack (https://llama-stack.readthedocs.io/en/latest/) and as provider of models to interact with Ollama. At first I used tool calling from models directly downloaded from Ollama. ...

andrealorenzetti's user avatar

andrealorenzetti

asked Jul 4, 2025 at 13:15

0 votes

0 answers

99 views

How to implement context-aware tool routing with local models like Ollama?

I'm using a locally hosted model(llama3.2) with Ollama and trying to replicate functionality similar to bind_tools(to create and run the tools with LLM ) for tool calling. This is my model service ...

Ahmad Ali's user avatar

Ahmad Ali

asked Jun 25, 2025 at 8:10

1 vote

0 answers

239 views

Multi MCP Tool Servers Issue with llama-3-3-70b-instruct

I'm following codes from links: https://github.com/jalr4ever/Tiny-OAI-MCP-Agent/blob/main/mcp_client.py https://github.com/philschmid/mcp-openai-gemini-llama-example/blob/master/...

Akshay Kulkarni's user avatar

Akshay Kulkarni

asked Jun 14, 2025 at 20:20

0 votes

1 answer

135 views

WASM LlamaEdge won't use GPU; problem fix or change tools?

So I'm trying to toss together a little demo that is essentially: 1) generate some text live and save to a file (I've got this working), 2) have a local instance of an LLM running (Llama3 in this case)...

PoGaMi's user avatar

PoGaMi

asked May 25, 2025 at 20:46

0 votes

0 answers

596 views

passing correct context to the model via the Ollama api

I am teaching myself LLM programming by developing a RAG application. I am running Llama 3.2 on my laptop using Ollama, and using a mix of SQLite and langchain. I can pass a context to the llm along ...

punkish's user avatar

punkish

15.6k

asked Apr 18, 2025 at 17:07

0 votes

0 answers

30 views

Encountering problem while fine tuning Llama3.1 using custom dataset with Lora

I am learning to fine tune Llama3.1 on a custom dataset.I have converted my dataset to a hugging face dataset.By evaluating directly using the model gives accuracy of 80%.Now when i am trying to fine ...

Jagatha Pugazhendhi's user avatar

Jagatha Pugazhendhi

asked Mar 26, 2025 at 4:01

0 votes

0 answers

350 views

Repetition Issues in Llama Models (3:8B, 3:70B, 3.1, 3.2)

I'm extracting Inputs, Outputs, and Summaries from large legacy codebases (COBOL, RPG), but facing repetition issues, especially when generating bullet points. Summaries work fine, but sections like ...

llama3

Saurav Srivastava's user avatar

Saurav Srivastava

asked Mar 5, 2025 at 9:10

0 votes

1 answer

136 views

llama31 - Results from tool ignored

I am communicating with ollama (llama3.1b) and have it respond with a tool call that I can resolve. However - I am struggling with the final call to ollama that would resolve the orginal question. I ...

Michaela.Merz's user avatar

Michaela.Merz

asked Feb 27, 2025 at 18:00

1 vote

1 answer

454 views

Unable to get llama3 to serve json reponse on a local ollama installaiton using jupyter notebook

On a windows 11 machine, I am trying to get a json reponse from the llama3 model on my local ollama installation on jupyter notebook but it does not work Steps I tried: This below snippet works ...

Pri's user avatar

Pri

asked Feb 27, 2025 at 5:41

0 votes

1 answer

226 views

llama3 responding only function call?

I am trying to make Llama3 Instruct able to use function call from tools , it does work but now it is answering only function call! if I ask something like who are you ? or what is apple device ? it ...

Kodr.F's user avatar

Kodr.F

14.5k

asked Feb 17, 2025 at 10:35

1 vote

0 answers

3k views

How can I accurately count tokens for Llama3/DeepSeek r1 prompts when Groq API reports "Request too large"?

I'm integrating the Groq API in my Flask application to classify social media posts using a model based on DeepSeek r1 (e.g., deepseek-r1-distill-llama-70b). I build a prompt by combining multiple ...

Towsif Ahamed Labib's user avatar

Towsif Ahamed Labib

asked Feb 2, 2025 at 16:19

0 votes

0 answers

143 views

How does batch option work in pipeline transformers library

I have a collection of news articles and I want to produce some new (unbiased) news articles using meta-llama/Meta-Llama-3-8B-Instruct. The articles are in a huggingface Dataset and to feed the ...

Xhulio Xhelilai's user avatar

Xhulio Xhelilai

asked Jan 3, 2025 at 16:45

15 30 50 per page

2 3 4 Next

CollectivesTM on Stack Overflow

No module named 'llama_models.cli.model' error while llama 3.1 8B downloading

Running Ollama on local computer and prompting from jupyter notebook - does the model recall prior prompts like if it was the same chat?

Custom NER to extract header, request and response from API document

LLM-Agent: Tool calling problem after conversion from HuggingFace to Ollama for llama stack

How to implement context-aware tool routing with local models like Ollama?

Multi MCP Tool Servers Issue with llama-3-3-70b-instruct

WASM LlamaEdge won't use GPU; problem fix or change tools?

passing correct context to the model via the Ollama api

Encountering problem while fine tuning Llama3.1 using custom dataset with Lora

Repetition Issues in Llama Models (3:8B, 3:70B, 3.1, 3.2)

llama31 - Results from tool ignored

Unable to get llama3 to serve json reponse on a local ollama installaiton using jupyter notebook

llama3 responding only function call?

How can I accurately count tokens for Llama3/DeepSeek r1 prompts when Groq API reports "Request too large"?

How does batch option work in pipeline transformers library

Hot Network Questions