LLM multiple modal applications

lightsong發表於2024-09-17

MoneyPrinterTurbo

https://github.com/harry0703/MoneyPrinterTurbo/tree/main

利用AI大模型,一鍵生成高畫質短影片 Generate short videos with one click using AI LLM.

FunClip

https://github.com/modelscope/FunClip

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

h2oGPT

https://github.com/h2oai/h2ogpt

Turn ★ into ⭐ (top-right corner) if you like the project!

Query and summarize your documents or just chat with local private GPT LLMs using h2oGPT, an Apache V2 open-source project.

LLM-Video-Sense

https://github.com/freeline55/LLM-Video-Sense

本專案在實現上將大語音模型大語言模型進行了有效整合,實現了兩個模型的介面組合,為語言模型新增了影片處理能力。總體架構圖如下:

專案總體架構圖

StreamRAG

https://github.com/video-db/StreamRAG

Video Search and Streaming Agent

What does it do? 🤔

It enables developers to:

  • 📚 Upload multiple videos to create a library or collection.
  • 🔍 Search across these videos and get real-time video responses or compilations.
  • 🛒 Publish your searchable collection on the ChatGPT store.
  • 📝 Receive summarized text answers (RAG).
  • 🌟 Gain key insights from specific videos (e.g. "Top points from episode 31").

LLM-Minutes-of-Meeting

https://github.com/inboxpraveen/LLM-Minutes-of-Meeting

🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where we'll be open for contributions to enable real-time meeting transcription! 🚀

https://github.com/AIAnytime/YouTube-Video-Summarization-App/tree/main

https://github.com/sokunheng/DownEdit

相關文章