ZePan110
8a9f3f4351
Organize set_env.sh paths and update README.md ( #1920 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
Co-authored-by: Ying Hu <ying.hu@intel.com >
2025-05-20 10:05:00 +08:00
Daniel De León
3fb59a9769
Update DocSum README and environment configuration ( #1917 )
...
Signed-off-by: Daniel Deleon <daniel.de.leon@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: Abolfazl Shahbazi <12436063+ashahba@users.noreply.github.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
Co-authored-by: Eero Tamminen <eero.t.tamminen@intel.com >
Co-authored-by: Zhenzhong Xu <zhenzhong.xu@intel.com >
2025-05-15 11:58:58 -07:00
Eero Tamminen
4efb1e0833
Update paths to GenAIInfra scripts ( #1923 )
...
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-05-10 21:57:52 +08:00
Sun, Xuehao
b467a13ec3
daily update vLLM&vLLM-fork version ( #1914 )
...
Signed-off-by: Sun, Xuehao <xuehao.sun@intel.com >
2025-05-08 10:34:36 +08:00
Melanie Hart Buehler
7bb05585b6
Move file processing from UI to DocSum backend service ( #1899 )
...
Signed-off-by: Melanie Buehler <melanie.h.buehler@intel.com >
2025-05-08 09:05:30 +08:00
ZePan110
99b62ae49e
Integrate DocSum set_env to ut scripts. ( #1860 )
...
Integrate DocSum set_env to ut scripts.
Add README.md for DocSum and InstructionTuning UT scripts.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-28 13:35:05 +08:00
chen, suyue
be5933ad85
Update benchmark scripts ( #1883 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-04-25 17:05:48 +08:00
chyundunovDatamonsters
ef9290f245
DocSum - refactoring README.md for deploy application on ROCm ( #1881 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-04-25 13:36:40 +08:00
chyundunovDatamonsters
3b0bcb80a8
DocSum - Adding files to deploy an application in the K8S environment using Helm ( #1758 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
Signed-off-by: Chingis Yundunov <c.yundunov@datamonsters.com >
Co-authored-by: Chingis Yundunov <YundunovCN@sibedge.com >
Co-authored-by: Artem Astafev <a.astafev@datamonsters.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chen, suyue <suyue.chen@intel.com >
2025-04-25 13:33:08 +08:00
vrantala
29d449b3ca
Added Initial version of DocSum support for benchmarking scripts for OPEA ( #1840 )
...
Signed-off-by: Valtteri Rantala <valtteri.rantala@intel.com >
Co-authored-by: Liang Lv <liang1.lv@intel.com >
Co-authored-by: ZePan110 <ze.pan@intel.com >
2025-04-21 10:32:28 +08:00
XinyaoWa
c7f06d5e54
Refine documents for DocSum ( #1802 )
...
Signed-off-by: Xinyao <xinyao.wang@intel.com >
2025-04-20 16:20:20 +08:00
mahathis
c73b09a758
Update AgentQnA and DocSum for Gaudi Compatibility ( #1777 )
...
Signed-off-by: Mahathi Vatsal <mahathi.vatsal.salopanthula@intel.com >
2025-04-15 22:01:27 -07:00
Liang Lv
13dd27e6d5
Update vLLM parameter max-seq-len-to-capture ( #1809 )
...
Signed-off-by: lvliang-intel <liang1.lv@intel.com >
2025-04-15 14:27:12 +08:00
XinyaoWa
063547fb66
Align DocSum env to vllm ( #1784 )
...
Signed-off-by: sys-lpot-val <sys_lpot_val@intel.com >
Co-authored-by: sys-lpot-val <sys_lpot_val@intel.com >
2025-04-10 11:38:24 +08:00
ZePan110
00d7a65dd8
Enable model cache for Rocm docker compose test. ( #1614 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-10 09:40:37 +08:00
ZePan110
5f4b3a6d12
Adaptation to vllm v0.8.3 build paths ( #1761 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-09 13:20:02 +08:00
dolpher
46ebb78aa3
Sync values yaml file for 1.3 release ( #1748 )
...
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-04-08 22:39:40 +08:00
Spycsh
d4952d1e7c
Refine third parties links ( #1764 )
...
Signed-off-by: Spycsh <sihan.chen@intel.com >
2025-04-08 18:39:13 +08:00
ZePan110
42735d0d7d
Fix vllm and vllm-fork tags ( #1766 )
...
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-04-07 22:58:50 +08:00
chyundunovDatamonsters
319dbdaa6b
Adding files to deploy DocSum application on ROCm vLLM ( #1572 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2025-04-03 14:20:23 +08:00
Xiaotian Chen
1bd56af994
Update TGI image versions ( #1625 )
...
Signed-off-by: xiaotia3 <xiaotian.chen@intel.com >
2025-04-01 11:27:51 +08:00
Letong Han
d4dcbd18ef
Enable vllm for DocSum ( #1716 )
...
Set vllm as default llm serving, and add related docker compose files, readmes, and test scripts.
Fix issue #1436
Signed-off-by: letonghan <letong.han@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-03-28 17:15:01 +08:00
chen, suyue
2204fe8e36
Enable base image build in CI/CD ( #1669 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-03-19 09:21:51 +08:00
Eero Tamminen
4269669f73
Use GenAIComp base image to simplify Dockerfiles & reduce image sizes - part 2 ( #1638 )
...
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-03-13 08:23:07 +08:00
ZePan110
5aecea8e47
Update compose.yaml ( #1619 )
...
Update compose.yaml for CodeGen, CodeTrans and DocSum
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-07 09:20:28 +08:00
ZePan110
c1b5ba281f
Enable CodeGen,CodeTrans and DocSum model cache for docker compose test. ( #1599 )
...
1.Add cache path check
2.Enable CodeGen,CodeTrans and DocSum model cache for docker compose test.
Signed-off-by: ZePan110 <ze.pan@intel.com >
2025-03-04 16:10:20 +08:00
dependabot[bot]
d46df4331d
Bump gradio from 5.5.0 to 5.11.0 in /DocSum/ui/gradio ( #1576 )
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
Co-authored-by: Liang Lv <liang1.lv@intel.com >
2025-02-25 14:32:03 +08:00
WenjiaoYue
abafd5de20
Update UI of the three demos: faqGen, VisualQnA, and DocSum. ( #1528 )
...
Signed-off-by: WenjiaoYue <wenjiao.yue@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-12 15:57:51 +08:00
chen, suyue
81b02bb947
Revert "HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN (#… ( #1521 )
...
Revert this PR since the test is not triggered properly due to the false merge of a WIP CI PR, 44a689b0bf , which block the CI test.
This change will be submitted in another PR.
2025-02-11 18:36:12 +08:00
xiguiw
45d5da2ddd
HUGGINGFACEHUB_API_TOKEN environment is change to HF_TOKEN ( #1503 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-09 20:33:06 +08:00
xiguiw
1b3291a1c8
Fix docker compose.yaml error ( #1496 )
...
Signed-off-by: Wang, Xigui <xigui.wang@intel.com >
2025-02-07 09:53:20 +08:00
Omar Khleif
32d4f714fd
Fix for NLTK related import failure ( #1487 )
...
Signed-off-by: okhleif-IL <omar.khleif@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-01 10:04:37 +08:00
dolpher
ee0e5cc8d9
Sync value files from GenAIInfra ( #1428 )
...
All gaudi values updated with extra flags.
Added helm support for 2 new examples Text2Image and SearchQnA. Minor fix for llm-uservice.
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-22 17:44:11 +08:00
WenjiaoYue
b721c256f9
Fix Domain Access Issue in Latest Vite Version ( #1444 )
...
Fix the restriction on using domain names when users are using the latest version of Vite
When users use the new version of Vite, the UI cannot be accessed via domain names due to Vite's new rules. This fix adds the corresponding parameters according to Vite's new rules, ensuring that users can access the frontend via domain names when building the UI.
Fixes #1441
Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com >
2025-01-21 23:28:37 +08:00
chen, suyue
927698e23e
Simplify git clone code in CI test ( #1434 )
...
1. Simplify git clone code in CI test.
2. Replace git clone branch in Dockerfile.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-21 23:00:08 +08:00
chen, suyue
6bfd156573
Clean up test scripts and enhance git clone ( #1417 )
...
1. Clean up test code in scripts.
2. Simplify git clone code.
3. Replace git clone branch in Dockerfile.
Signed-off-by: chensuyue <suyue.chen@intel.com >
2025-01-20 16:34:28 +08:00
XinyaoWa
39409d7f61
Align OpenAI API for FaqGen, DocSum ( #1401 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-17 11:19:35 +08:00
XinyaoWa
71e3c57366
Standardize name for LLM comps ( #1402 )
...
Update all the names for classes and files in llm comps to follow the standard format, related GenAIComps PR opea-project/GenAIComps#1162
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-16 23:10:27 +08:00
Eero Tamminen
0eae391fda
Use staged builds to minimize final image sizes ( #1031 )
...
Staged image builds so that final images do not have redundant things like:
- Git tool and its deps
- Git repo history
- Test directories
Fixes : #225
Signed-off-by: Eero Tamminen <eero.t.tamminen@intel.com >
2025-01-16 11:14:47 +08:00
XinyaoWa
ff1310b11a
Refactor docsum ( #1336 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-13 15:49:48 +08:00
dolpher
c795ef2203
Add helm deployment instructions for GenAIExamples ( #1373 )
...
Add helm deployment instructions for ChatQnA, AgentQnA, AudioQnA, CodeTrans, DocSum, FaqGen and VisualQnA
Signed-off-by: Dolpher Du <dolpher.du@intel.com >
2025-01-10 09:55:31 +08:00
Jaswanth Karani
ddacb7e86d
fixed build issue ( #1367 )
2025-01-08 22:19:23 +08:00
XinyaoWa
464e2d3125
Rename streaming to stream to align with OpenAI API ( #1332 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
2025-01-06 13:25:55 +08:00
Sihan Chen
cc1d97f816
Refactor AudioQnA/MultiModalQnA/AvatarChatbot ( #1310 )
...
Signed-off-by: chensuyue <suyue.chen@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: chensuyue <suyue.chen@intel.com >
2024-12-31 12:47:30 +08:00
Sihan Chen
a01729a5c2
Refactor DocSum example ( #1286 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-26 14:45:17 +08:00
XinyaoWa
50dd959d60
Support Long context for DocSum ( #1255 )
...
Signed-off-by: Xinyao Wang <xinyao.wang@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: lkk <33276950+lkk12014402@users.noreply.github.com >
2024-12-20 19:17:10 +08:00
Mustafa
84a6a6e9bc
Adding URL summary option to DocSum Gradio-UI ( #1248 )
...
Signed-off-by: okhleif-IL <omar.khleif@intel.com >
Co-authored-by: okhleif-IL <omar.khleif@intel.com >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Co-authored-by: lkk <33276950+lkk12014402@users.noreply.github.com >
Co-authored-by: chen, suyue <suyue.chen@intel.com >
Co-authored-by: WenjiaoYue <wenjiao.yue@intel.com >
2024-12-19 10:49:03 +08:00
chyundunovDatamonsters
67634dfd22
DocSum - Solving the problem of running DocSum on ROCm ( #1268 )
...
Signed-off-by: Chingis Yundunov <YundunovCN@sibedge.com >
2024-12-18 17:38:38 +08:00
lkk
2af1ea0f8e
remove examples gateway. ( #1243 )
...
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-12-13 15:16:11 +08:00
Omar Khleif
00b526c8e5
Changed Default UI to Gradio ( #1246 )
...
Signed-off-by: okhleif-IL <omar.khleif@intel.com >
2024-12-11 11:04:10 -08:00