-
Notifications
You must be signed in to change notification settings - Fork 277
Update TGI image versions #1749
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com>
Dependency Review✅ No vulnerabilities or license issues found.Scanned FilesNone |
There are conflicts. |
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com>
There are new conflicts. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR updates the TGI image versions across multiple Compose files for various platforms. In particular, it upgrades the Intel CPU images to version 2.4.1, the AMD/GPU (ROCm) images to version 2.4.1, and the Gaudi image to version 2.3.1.
- Upgrade Intel CPU images for ChatQnA, AvatarChatbot, and AudioQnA from 2.4.0 to 2.4.1
- Upgrade AMD/GPU (ROCm) images in ChatQnA, AvatarChatbot, AudioQnA, and AgentQnA from versions 2.3.1 (and in one case 3.0.0) to 2.4.1
- Update documentation in corresponding README.md files to reflect the new image tags
Reviewed Changes
Copilot reviewed 56 out of 61 changed files in this pull request and generated no comments.
Show a summary per file
File | Description |
---|---|
ChatQnA/docker_compose/intel/cpu/xeon/compose_tgi.yaml | Updated Intel CPU image version for TGI service |
ChatQnA/docker_compose/intel/cpu/xeon/compose_faqgen_tgi.yaml | Updated Intel CPU image version for TGI FAQ service |
ChatQnA/docker_compose/intel/cpu/xeon/README.md | Updated documentation with new TGI CPU image version |
ChatQnA/docker_compose/amd/gpu/rocm/*.yaml & README.md | Updated AMD/GPU (ROCm) image tags to version 2.4.1 |
AvatarChatbot/docker_compose/intel/hpu/gaudi/*.yaml & README.md | Updated Gaudi image to version 2.3.1 |
AvatarChatbot/docker_compose/intel/cpu/xeon/*.yaml & README.md | Updated Intel CPU image to version 2.4.1 |
AvatarChatbot/docker_compose/amd/gpu/rocm/compose.yaml | Updated AMD/GPU (ROCm) image from 2.3.1 to 2.4.1 |
AudioQnA/docker_compose/intel/cpu/xeon/compose_tgi.yaml | Updated Intel CPU image to version 2.4.1 |
AudioQnA/docker_compose/amd/gpu/rocm/README.md | Updated AMD/GPU (ROCm) image tag to version 2.4.1 |
AgentQnA/docker_compose/amd/gpu/rocm/compose.yaml | Updated image version from 3.0.0-rocm to 2.4.1-rocm |
AgentQnA/docker_compose/amd/gpu/rocm/README.md | Updated documentation with new AMD/GPU (ROCm) image tag |
Files not reviewed (5)
- AudioQnA/tests/test_compose_tgi_on_gaudi.sh: Language not supported
- AudioQnA/tests/test_compose_tgi_on_xeon.sh: Language not supported
- AvatarChatbot/tests/test_compose_on_gaudi.sh: Language not supported
- AvatarChatbot/tests/test_compose_on_rocm.sh: Language not supported
- AvatarChatbot/tests/test_compose_on_xeon.sh: Language not supported
Comments suppressed due to low confidence (1)
AgentQnA/docker_compose/amd/gpu/rocm/compose.yaml:4
- The image tag is downgraded from 3.0.0-rocm to 2.4.1-rocm. Please confirm if this downgrade is intentional or if the intended version should be maintained.
image: ghcr.io/huggingface/text-generation-inference:3.0.0-rocm
TGI Gaudi update to 2.3.1 log from whiper load failed.
|
The whisper-gaudi issue is being tracked by opea-project/GenAIComps#1608 |
Description
Versions in AvatarChatbot and ChatQnA-ROCm are not updated due to known issues.
Issues
Related to #1563
Type of change
List the type of change like below. Please delete options that are not relevant.
Dependencies
N/A
Tests
git grep /tgi-gaudi: | grep -v 2.3.1