Adding parameter support for Local LLMs and GROQ #808

gpapp · 2024-03-28T07:16:05Z

Some locally served LLMs need more fine-tune in their parameters, than the OpenAI API.

GROQ also opened their API-s, which has a limited parameters, that can be passed over in the API

The prompt example containts a timeout quoted, while it expects numeric value. This is a waste of many tokens to resolve.

senko · 2024-03-29T17:38:48Z

Thanks for this @gpapp !

We have an experimental groq branch, which uses their official SDK instead of making manual requests (which is what we'd like to do in the future with the default openai compatible client function).

Do you think you could adapt your changes to work on that? This would also make them less risk (in terms of how different people have set up environments wihch may inadvertenly cause problems with extra env flags), as it would only apply to Groq so far.

Btw the prompt mismatch is a good catch, I was convinced that was a shortcoming of Mixtral.

Added more aggressive file content parsing to CodeMonkey.py Added more aggressive response parsing to function_calling.py

Moved return type enforcement to function_calling This should work with OpenAI as well.

@senko

merging groq from @senko with own implementation

gpapp · 2024-03-30T21:12:01Z

Updated to use the SDK (which in fact uses the openAI compatible API for now) readded all the modifiers that can be passed as parameters to groq.

JamesD-git

Worked for me macOS python 3.11

gpapp and others added 9 commits March 23, 2024 10:48

Adding parameters to control local LLMs.

0f4b1a1

Fixing invalid JSON in prompt template

5c8f320

Adding parameters to control local LLMs.

e206508

Fixing invalid JSON in prompt template

2e182be

Fixing defaulted temperature

5332853

Merged

6ecc7bd

Adding GROQ support

1276ab8

experimental support for groq (2nd attempt), wip

e7a5a5c

Merge branch 'main' into Prompt-example-timeout-fix

afeff7a

gpapp added 9 commits March 30, 2024 17:06

Added groq SDK.

1c261a0

Added more aggressive file content parsing to CodeMonkey.py Added more aggressive response parsing to function_calling.py

Fixed additional parameter casting to float.

4fcb235

Moved return type enforcement to function_calling This should work with OpenAI as well.

Merge branch 'Pythagora-io:main' into groq_merge

c7bd3da

Merge branch 'groq_merge' into main

6949110

Overwrite static temperature

54dab8e

Merge with upstream branch

5b1ae39

Updating defaults

b8d6309

Merge branch 'Prompt-example-timeout-fix' into main

8f27fd5

Merge pull request #2 from gpapp/main

55c76dd

merging groq from @senko with own implementation

JamesD-git approved these changes Apr 22, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding parameter support for Local LLMs and GROQ #808

Adding parameter support for Local LLMs and GROQ #808

gpapp commented Mar 28, 2024 •

edited

senko commented Mar 29, 2024

gpapp commented Mar 30, 2024

JamesD-git left a comment

Adding parameter support for Local LLMs and GROQ #808

Are you sure you want to change the base?

Adding parameter support for Local LLMs and GROQ #808

Conversation

gpapp commented Mar 28, 2024 • edited

senko commented Mar 29, 2024

gpapp commented Mar 30, 2024

JamesD-git left a comment

Choose a reason for hiding this comment

gpapp commented Mar 28, 2024 •

edited