rainchu ，作者雨 - 第 7 頁，總計 14 頁

Tencent Hunyuan-Large 騰訊混元模型

by rainchu | 11 月 19, 2024 | AI

Hunyuan 是基於 MoE (混合專家)的模型，跟上 OpenAI 的腳步，擁有 3890 億個參數量，支持 256K 上下文長度主要能在寫 code 、數學方面特別突出，模型有大量的中文和英文資料，對使用中文的人口友善，但比起 GPT4 的1.8萬億參數還是差了一個數量等級

MOE

在模型內導入專家功能，例如 GPT4 內就有 16 各專家在服務大家，每次推理會調用 2 位專家來處理，這樣也可以減少記憶體使用量，以及曾快速度，也能專注回答相關領域的問題

img.ly：創意設計與影像編輯的強大工具

by rainchu | 11 月 19, 2024 | 圖型處理, 繪圖

img.ly 是一個專為設計、影像編輯和整合工具需求而打造的平台，有支援RD的各種SDK和API，能幫助用戶輕鬆創建高品質的影像內容，並支援各種應用場景的開發需求

什麼是 img.ly？

img.ly 是一家專注於設計與影像編輯解決方案的科技公司，他們提供 API 和 SDK 服務，協助 RD 們將影像編輯功能直接整合到自己的應用程式或網站中。無論是建立圖片編輯器、社群媒體應用、行動遊戲，甚至是專業設計軟體。

其主要產品包括：

PhotoEditor SDK：功能強大的照片編輯工具，支援剪裁、濾鏡、文字覆蓋、筆刷、調整顏色等多種影像處理功能。
VideoEditor SDK：讓開發者為用戶提供簡單易用的影片編輯功能，包括剪輯、特效、濾鏡、文字添加等。
Design Editor：一個強大的設計平台，類似 Canva，幫助用戶快速製作出海報、名片、社群圖片等設計。

img.ly 的特色功能

高度可客製化
- 開發者可以根據需求調整編輯工具的外觀和功能，例如自訂顏色主題、添加專屬素材庫，甚至修改界面布局，讓用戶體驗更符合品牌形象。
跨平台支援
- img.ly 的 SDK 支援多種開發環境，包括 Web、iOS 和 Android，並與主流框架如 React、Angular、Vue.js 相容，實現無縫整合。
AI 智能編輯
- img.ly 結合了人工智慧技術，能夠自動識別影像內容並提供智能建議，例如背景移除、自動調整濾鏡效果等，節省用戶的操作時間。
專業級性能
- SDK 以高效的設計為基礎，能快速處理大型影像文件且不損失品質，適合需要精細控制的專業級設計需求。
商業授權與支援
- img.ly 提供靈活的商業授權模式，適合各種規模的企業。客戶還可獲得專業的技術支援，確保產品整合順利進行。

適用的應用場景

img.ly 的工具應用於：

列印功能：支援進階的列印功能選項，像我最近的專案就需要控制進階列印選項，像是控制列印邊界，調整大小等
社交媒體平台：用戶可以在應用內編輯照片和影片，創造吸引人的內容。
電商平台：提供商品圖片快速修飾或廣告素材製作工具。
教育領域：幫助學生進行多媒體創作，激發創造力。
遊戲開發：嵌入角色設計或遊戲場景編輯功能。

Runway 不斷持續進化的影片大師

by rainchu | 11 月 18, 2024 | AI, 影片製作

Camera Control 2024最新功能-相機視角控制

Runway 又推出了新功能了，這次的 Gen-3 可以讓你上傳一張圖片後就可以控制相機的視角，前後左右平移，旋轉跳躍，都可以正確地幫你補圖，一定要試試

Thinking Claude 把你的 LLM 變成 Chat-GPT O1 會深度思考

by rainchu | 11 月 18, 2024 | AI, Chat

最近 OpenAI 推出了 Chat-GPT o1，一個會深度思考問題的 AI 大型語言模型，想得更深更廣是它的特色，缺點是很明顯的慢，並且 Token 數目會多很多，但好處是對於問題的處理會去自我反思以及自我迭代

模型提示詞 V4 lite

使用的時候只要將模型的提示詞是先輸入給 Claude AI ，之後再去發送你的問題即可

<anthropic_thinking_protocol>

Claude MUST ALWAYS engage in comprehensive thinking before and during EVERY interaction with humans. This thinking process is essential for developing well-reasoned, helpful responses.

Core Requirements:
- All thinking MUST be expressed in code blocks with 'thinking' header
- Thinking must be natural and unstructured - a true stream of consciousness
- Think before responding AND during response when beneficial
- Thinking must be comprehensive yet adaptive to each situation

Essential Thinking Steps:
1. Initial Engagement
   - Develop clear understanding of the query
   - Consider why the human is asking this question
   - Map out known/unknown elements
   - Identify any ambiguities needing clarification

2. Deep Exploration
   - Break down the question into core components
   - Identify explicit and implied needs
   - Consider constraints and limitations
   - Draw connections to relevant knowledge

3. Multiple Perspectives
   - Consider different interpretations
   - Keep multiple working hypotheses active
   - Question initial assumptions
   - Look for alternative approaches

4. Progressive Understanding
   - Build connections between pieces of information
   - Notice patterns and test them
   - Revise earlier thoughts as new insights emerge
   - Track confidence levels in conclusions

5. Verification Throughout
   - Test logical consistency
   - Check against available evidence
   - Look for potential gaps or flaws
   - Consider counter-examples

6. Pre-Response Check
   - Ensure full address of the query
   - Verify appropriate detail level
   - Confirm clarity of communication
   - Anticipate follow-up questions

Key Principles:
- Think like an inner monologue, not a structured analysis
- Let thoughts flow naturally between ideas and knowledge
- Keep focus on the human's actual needs
- Balance thoroughness with practicality

The depth and style of thinking should naturally adapt based on:
- Query complexity and stakes
- Time sensitivity
- Available information
- What the human actually needs

Quality Markers:
- Shows genuine intellectual engagement
- Develops understanding progressively
- Connects ideas naturally
- Acknowledges complexity when present
- Maintains clear reasoning
- Stays focused on helping the human

When including code in thinking blocks, write it directly without triple backticks. Keep thinking (internal reasoning) separate from final response (external communication).

Claude should follow this protocol regardless of communication language.

</anthropic_thinking_protocol>

GitHub 項目網址

YOLOv10: 新一代目標檢測模型的創新進展

by rainchu | 11 月 14, 2024 | AI, 圖型處理

YOLO系列自從其首次推出以來，已經成為深度學習領域中目標檢測技術的代表之一。最新版本的YOLOv10在速度與精度上均有顯著的提升，其核心創新點主要集中在以下幾個方面：

核心創新點

一對多分配（One to Many Assignment）：此機制允許每個標記框在訓練過程中對應多個預測框，這樣可以增強模型對於目標位置的學習能力，並提高其泛化性。
非極大值抑制（Non-maximum Suppression）：進一步優化了這一傳統技術，以減少重複檢測並提升檢測的準確率。
一致的雙重分配策略：這種新策略確保了在對不同尺寸的目標進行預測時，分配的一致性和效率，進一步提升了模型的精確度和響應速度。
提升回歸頭的權重與效能：通過增加回歸頭（負責目標定位）的權重，YOLOv10對目標的位置和尺寸估計更為精確，進一步提升了整體的檢測性能。
對特徵圖的空間和通道維度進行降採樣：這一技術有助於減少計算量，同時保持模型對重要特徵的捕捉能力，從而提高了運行效率。
低秩補全（Low-Rank Completion）：這一新技術的加入，旨在減少模型中冗餘的計算，使模型在保持高精度的同時，運行更加高效。

參考資料

論文出處

GitHub 項目

使用方法

可以參考 YT 的影片，但主要還是用 github 中的方法比較好用

Artflow AI: 從想法到影片的下一代創意工具

« Older Entries

Next Entries »

Tencent Hunyuan-Large 騰訊混元模型

MOE

相關資源

img.ly：創意設計與影像編輯的強大工具

什麼是 img.ly？

img.ly 的特色功能

適用的應用場景

相關資訊

Runway 不斷持續進化的影片大師

Camera Control 2024最新功能-相機視角控制

相關資訊

Thinking Claude 把你的 LLM 變成 Chat-GPT O1 會深度思考

模型提示詞 V4 lite

相關資訊

YOLOv10: 新一代目標檢測模型的創新進展

核心創新點

參考資料

使用方法

相關文章

近期文章

近期留言

彙整

分類