secondlifeのブックマーク - はてなブックマーク

secondlife id:secondlife

secondlifeのブックマーク (8,235)

Developer Summit 2025 [14-D-1] Yuki Hattori
Bringing Open-Source Brilliance to Scrum Teams: �A Guide to Enhanced Collaboration
secondlife 2025/02/16
リンク
論文まとめ：Competitive Programming with Large Reasoning Models | Shikoan's ML Blog
大規模言語モデル(LLM)を強化学習によって訓練し、競争プログラミングにおける高度な推論とコーディング能力を高めたOpenAIによるLRMの研究。特にo3モデルは、人間によるドメイン特化のテスト戦略を使わずとも、IOIやCodeForcesなどで上位レベルの成績を収める性能を示した。タイトル：Competitive Programming with Large Reasoning Models 著者：OpenAIの方々論文URL：https://arxiv.org/abs/2502.06807 論文まとめ By Gemini この論文「Competitive Programming with Large Reasoning Models」は、大規模言語モデル（LLM）の競争プログラミングにおける性能向上を、強化学習を用いて達成することを目指しています。・この論文において解決したい課題
secondlife 2025/02/14
“論文まとめ：Competitive Programming with Large Reasoning Models Posted On 2025-02-14 こしあん ”
リンク
Optimizing Memory for Bulk Uploads - Qdrant
secondlife 2025/02/14
リンク
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
secondlife 2025/02/14
リンク
SWE-bench
Can Language Models Resolve Real-World GitHub Issues? ICLR 2024 Carlos E. Jimenez*, John Yang*, Alexander Wettig, Shunyu Yao, Kexin Pei, Ofir Press, Karthik Narasimhan News 📣 [02/2025] We've released SWE-agent 1.0. New upgrades include better performance (🥇 OSS on Lite), massive parallelism, and more! [Link] 📣 [10/2024] Introducing SWE-bench Multimodal! Can AI systems "see" bugs and fix them?
secondlife 2025/02/14
リンク
Ollama で Hugging Face Hub の GGUF をそのまま使ってみる｜ぬこぬこ
tl;drOllama で Modelfile を作成することなく、Hugging Face Hub で公開されている GGUF 形式のモデルをそのまま動かせるようになったよやり方は簡単で ollama run hf.co/{ユーザー名}/{リポジトリ名} とするだけ量子化タイプの指定はデフォルト Q4_K_M で好きなタイプも指定可チャットテンプレートはデフォルトは GGUF 内ビルトインテンプレートで自分で指定することも可 Fuck it! You can now run *any* GGUF on the Hugging Face Hub directly with @ollama 🔥 This has been a constant ask from the community, starting today you can point to any of the 45,0
secondlife 2025/02/14
リンク
Qwen2.5とDeepSeek R1を利用した日本語大規模言語モデル「Qwen2.5 Bakeneko 32B」シリーズを公開｜rinna株式会社
ホームニュース Qwen2.5とDeepSeek R1を利用した日本語大規模言語モデル「Qwen2.5 Bakeneko 32B」シリーズを公開 DeepSeek R1を用いた蒸留学習により効率よく日本語思考能力を強化 rinna株式会社 (本社：東京都渋谷区/代表取締役：宋珠憲、以下rinna) は、Qwen2.5とDeepSeek R1を利用し、高い日本語テキストの生成能力を持った「Qwen2.5 Bakeneko 32B」シリーズを開発し、Apache-2.0 Licenseで公開したことを発表します。 ■ rinnaのモデル公開活動これまでにrinnaでは、日本語のタスクに適したGPT・BERT・HuBERT・CLIP・Stable Diffusionなど、テキスト・音声・画像に関する基盤モデルを公開してきました。2021年4月からHugging Faceに公開してきたrin
secondlife 2025/02/14
“Qwen2.5 Bakeneko 32Bをベースにモデルマージと、DeepSeek R1から生成した思考過程を含むテキストデータ1,200件を利用し蒸留学習を行なった”
リンク
Whisk - labs.google/fx
Ready to evolve your Image creation?Sign in and let's see what you can dig up.
secondlife 2025/02/14
リンク
GPTとは何か Transformerの視覚化 | Chapter 5, Deep Learning
この動画は3Blue1Brownの動画を東京大学の学生有志団体が翻訳・再編集し公式ライセンスのもと公開しているものです。チャンネル登録と高評価をよろしくお願いいたします。日本語版Twitter https://twitter.com/3B1BJP 元チャンネル（英語） https://www.youtube.com/c/3blue1brown 元動画（英語） https://youtu.be/wjZofJX0v4M?si=9YsuEzHATlhPtpOF Check out our new channel Ufolium https://www.youtube.com/watch?v=wrNCjIjIzuk&pp=ygUj5aSn57Wx6aCY6YG45oyZ44Gu5LuV57WE44G_IHVmb2xpdW0%3D Richard Turner's introduction
secondlife 2025/02/13
リンク
GitHub - atfortes/Awesome-LLM-Reasoning: Reasoning in LLMs: Papers and Resources, including Chain-of-Thought, OpenAI o1, and DeepSeek-R1 🍓
secondlife 2025/02/13
“Awesome LLM Reasoning ”
リンク
DeepRAG: Thinking to Retrieval Step by Step for Large Language Models
- 3 users
- arxiv.org
- 学び
Large Language Models (LLMs) have shown remarkable potential in reasoning while they still suffer from severe factual hallucinations due to timeliness, accuracy, and coverage of parametric knowledge. Meanwhile, integrating reasoning with retrieval-augmented generation (RAG) rem ains challenging due to ineffective task decomposition and redundant retrieval, which can introduce noise and degrade resp
secondlife 2025/02/12
リンク
AI Engineering
secondlife 2025/02/10
リンク
AI Engineering Landscape
Original list from Developing Apps with GPT-4 and ChatGPT by Olivier Caelen and Marie-Alice Blete (O'Reilly).
secondlife 2025/02/10
リンク
The End of Programming as We Know It
Join the O'Reilly online learning platform. Get a free trial today and find answers on the fly, or master something new and useful. Learn more Betty Jean Jennings and Frances Bilas (right) program the ENIAC in 1946. Via the Computer History Museum Eventually, interpreted languages, which are much easier to debug, became the norm. BASIC, one of the first of these to hit the big time, was at first s
secondlife 2025/02/10
“By Tim O’Reilly ”
リンク
Coolify
Any Language Coolify is compatible with a wide range of programming languages and frameworks, enabling you to launch static websites, APIs, backends, databases, services, and other types of applications. Any Server You can deploy your resources to any server, including your own servers, VPS, Raspberry Pi, EC2, DigitalOcean, Linode, Hetzner, and more. All you need is an SSH connection.
secondlife 2025/02/10
“An open-source & self-hostable Heroku / Netlify / Vercel alternative. ”
リンク
埋め込み単語当てゲーム🧞
本ゲームの解説記事検索機能には static-embedding-japanese を使用しています（モデルのライセンス：MIT License、作者：hotchpotch and contributors）
secondlife 2025/02/10
static-embedding-japanese を使った、単語埋め込みを推察して当てるゲーム。めちゃむずいおもしろ！
リンク
Packing with Flash Attention で効率的に LLM のファインチューニングをする | ho.lc
LLM のファインチューニングにおいて入力トークン列の packing と Attention 計算の効率化は、長いトークン列を扱う際に特に重要です。この記事では、長さの異なる入力トークン列を packing によってコンパクトにまとめて、追加の変換処理を行うことなく、Flash Attention のオリジナルの論文実装である flash-attn パッケージにある varlen (variable-length) interface である flash_attn_varlen_func 関数にシームレスにつなげて Attention 計算する実装を紹介します。本記事の要点は以下のとおりです： Padding vs Packing LLM の訓練時において、ミニバッチ内の入力トークン列の padding は、長さが異なる複数のトークン列をまとめて扱うための一般的な手法です。しかし、pa
secondlife 2025/02/10
リンク
vidore/colqwen2.5-v0.1 · Hugging Face
secondlife 2025/02/10
ColQwen2.5: Visual Retriever based on Qwen2.5-VL-3B-Instruct with ColBERT strategy
リンク
Vidore Leaderboard - a Hugging Face Space by vidore
Discover amazing ML apps made by the community
secondlife 2025/02/10
リンク
最強DB講義 #35 大規模言語モデルに基づく検索モデル
BERTやGPTなどの大規模言語モデルに基づく検索モデルは、様々なテストコレクションにおいて優れた性能を発揮し多くの注目を集めている。これまでに様々な検索モデルが提案されてきており、大規模言語モデルの情報検索への適用は、実サービスにおいても一般的なものとなっている。本講義では、大規模言語モデルを利用した…
secondlife 2025/02/10
DPR, ColBERT, SPLADE の流れがまとまっていてわかりやすい
リンク
1 2 3 4 5 6 7 8 9 10 次のページ