docs: Added Deploying LLMs into production + a new ecosystem #4047

kouroshHakha · 2023-05-03T14:38:40Z

No description provided.

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

kouroshHakha · 2023-05-03T15:15:57Z

One possible extension to this PR:

Add an example of self-hosted chains (e.g. gpt2 from hf) to serve.

skcoirz · 2023-05-03T17:27:15Z

ah, this is a good idea. Conventionally, inference services are hosted by LLM providers. I was thinking of LangChain as a part of product logic, but after reading your PR, I realized we are a valuable inference host too especially after so many customization through multi-api combination, (prefix) prompt engineering and complex agent features.

Thank you for sharing with us your great ideas!

… things Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

hwchase17

this is more of a ray integration than a comprehensive guide on deploying LLMs in production

this seems more suitable to go in the ecosystem section

kouroshHakha · 2023-05-05T04:40:03Z

Hey @hwchase17 , thanks for your input. I'm considering revising the section on "Deployment of LLMs in production." Instead of the current story about how Ray serve can assist with deployment, I'm thinking of creating a more general tutorial on the key concepts to consider when deploying LLMs (autoscaling, spot-instance serving, defining end-points, etc). Then I can link it to the ray integration page for follow-ups. What do you think of this approach? Do you have any other suggestions or ideas to improve this section?

richardliaw · 2023-05-05T09:14:29Z

Hey there - also from Ray team here -- your feedback makes sense @hwchase17 !

Probably better for this PR to actually start the "comprehensive guide for deploying LLMs in production". We could use both the Ray and the BentoML (https://github.com/ssheng/BentoChain) example as a starting point, so that it doesn't just look like a basic Ray integration.

And the sections of the guide would include the parts that @kouroshHakha mentioned.

Thoughts?

kamil-kaczmarek · 2023-05-05T19:15:58Z

I think it makes sense. we can keep the comprehensive guide for deploying LLMs in production focused more on concept understanding and provide solution starters. From there we can link to more detailed examples.

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

kouroshHakha · 2023-05-08T20:56:20Z

Hey @hwchase17, I updated the main section to be more general and more in the lines of a comprehensive guide for deploying LLMs in production. I am thinking of linking different serving ecosystems (ray serve, bentochain, etc) to this main doc and this should live somewhere in the main doc pages that is super visible (maybe under LLMs?) I can polish the text / figures a bit more if the outline sounds reasonable to you. Thanks.

…edbacks Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

hwchase17

ray serve notebook looks good!

Deploying LLMs notebook is also good in content, but i think in wrong place. modules is very specific to code in the library, this is more of (very good) general purpose documentation.

i would suggest we move this to the additional resources section (and maybe make it a markdown file, no reason for it to be ipynb)

…n-ray

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

docs/additional_resources/deployments.md

kamil-kaczmarek · 2023-05-17T01:27:29Z

Hi @hwchase17 thanks for the suggestions.

@kouroshHakha implemented requested the changes. Please have a look.

Co-authored-by: Kamil Kaczmarek <[email protected]>

kouroshHakha · 2023-05-21T18:20:00Z

@hwchase17 I think this PR is ready to be merged. Can someone from your team do a final pass? Thanks.

hwchase17

lgtm - thanks! sorry for dealy

…in-ai#4047) Signed-off-by: Kourosh Hakhamaneshi <[email protected]> Co-authored-by: Kamil Kaczmarek <[email protected]> Co-authored-by: Harrison Chase <[email protected]>

created deploying llms into production

428fc14

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

kouroshHakha added 3 commits May 3, 2023 11:53

added a little bit of touch to the structure and removing unnecessary…

0cbd6e1

… things Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

updated the goal

c77e06d

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

changed goal

5b0bc44

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

hwchase17 reviewed May 4, 2023

View reviewed changes

added a more general preamble for deploying llms in production

678a82a

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

redid the deployment of llms notebook taking into account Tanmay's fe…

a1c6cd0

…edbacks Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

kouroshHakha changed the title ~~docs: Added Deploying LLMs into production~~ docs: Added Deploying LLMs into production + Ray serve ecosystem May 10, 2023

kouroshHakha changed the title ~~docs: Added Deploying LLMs into production + Ray serve ecosystem~~ docs: Added Deploying LLMs into production + a new ecosystem May 10, 2023

kouroshHakha requested a review from hwchase17 May 10, 2023 19:24

hwchase17 reviewed May 14, 2023

View reviewed changes

kouroshHakha added 6 commits May 16, 2023 13:19

Merge branch 'master' of github.com:hwchase17/langchain into langchai…

f42ee89

…n-ray

1. converted deploy_llm.ipyn to rst 2. Moved it to additional resources

c5b8f58

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

added anyscale to the deployment

84f2ab7

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

added anyscale

233c446

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

removing changes in modules

b8c8a1a

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

wip

988f012

Signed-off-by: Kourosh Hakhamaneshi <[email protected]>

kamil-kaczmarek reviewed May 17, 2023

View reviewed changes

docs/additional_resources/deployments.md Outdated Show resolved Hide resolved

kamil-kaczmarek approved these changes May 17, 2023

View reviewed changes

Update docs/additional_resources/deployments.md

7f81bcb

Co-authored-by: Kamil Kaczmarek <[email protected]>

hwchase17 approved these changes Jun 3, 2023

View reviewed changes

cr

d2fea5f

hwchase17 added the lgtm label Jun 3, 2023

hwchase17 merged commit 625717d into langchain-ai:master Jun 5, 2023

This was referenced Jun 25, 2023

Zep Authentication #6725

Closed

Zep Authentication #6728

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: Added Deploying LLMs into production + a new ecosystem #4047

docs: Added Deploying LLMs into production + a new ecosystem #4047

Uh oh!

kouroshHakha commented May 3, 2023

Uh oh!

kouroshHakha commented May 3, 2023 •

edited

Loading

Uh oh!

skcoirz commented May 3, 2023

Uh oh!

hwchase17 left a comment

Uh oh!

kouroshHakha commented May 5, 2023

Uh oh!

richardliaw commented May 5, 2023

Uh oh!

kamil-kaczmarek commented May 5, 2023

Uh oh!

kouroshHakha commented May 8, 2023

Uh oh!

hwchase17 left a comment

Uh oh!

Uh oh!

kamil-kaczmarek commented May 17, 2023

Uh oh!

kouroshHakha commented May 21, 2023

Uh oh!

hwchase17 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

docs: Added Deploying LLMs into production + a new ecosystem #4047

docs: Added Deploying LLMs into production + a new ecosystem #4047

Uh oh!

Conversation

kouroshHakha commented May 3, 2023

Uh oh!

kouroshHakha commented May 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

skcoirz commented May 3, 2023

Uh oh!

hwchase17 left a comment

Choose a reason for hiding this comment

Uh oh!

kouroshHakha commented May 5, 2023

Uh oh!

richardliaw commented May 5, 2023

Uh oh!

kamil-kaczmarek commented May 5, 2023

Uh oh!

kouroshHakha commented May 8, 2023

Uh oh!

hwchase17 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kamil-kaczmarek commented May 17, 2023

Uh oh!

kouroshHakha commented May 21, 2023

Uh oh!

hwchase17 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

kouroshHakha commented May 3, 2023 •

edited

Loading