robertnishihara's submissions

1.		Major upgrades to Ray Serve: 88% lower latency and 11.1x higher throughput (anyscale.com)
		2 points by robertnishihara 44 days ago \| past \| 1 comment
2.		SkyRL brings Tinker to your GPUs (2025) (novasky-ai.notion.site)
		24 points by robertnishihara 84 days ago \| past \| 5 comments
3.		vLLM large scale serving: DeepSeek 2.2k tok/s/h200 with wide-ep (vllm.ai)
		147 points by robertnishihara 3 months ago \| past \| 54 comments
4.		Massively Parallel Agentic Simulations with Ray (anyscale.com)
		2 points by robertnishihara 8 months ago \| past
5.		Deploy DeepSeek‑R1 with VLLM and Ray Serve on Kubernetes (anyscale.com)
		1 point by robertnishihara 9 months ago \| past
6.		An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (anyscale.com)
		1 point by robertnishihara 9 months ago \| past
7.		Native LLM APIs in Ray Data and Ray Serve (anyscale.com)
		2 points by robertnishihara 10 months ago \| past
8.		Joins and Hash-Shuffle in Ray Data (anyscale.com)
		3 points by robertnishihara 10 months ago \| past
9.		AsyncFlow: An Asynchronous Streaming RL Framework for LLM Post-Training (arxiv.org)
		4 points by robertnishihara 10 months ago \| past
10.		Open Source RL Libraries for LLMs (anyscale.com)
		1 point by robertnishihara 10 months ago \| past
11.		Large-Scale Deployment of Ray in Tencent's Weixin AI Infrastructure (anyscale.com)
		2 points by robertnishihara 10 months ago \| past
12.		Uv and Ray: Pain-Free Python Dependencies in Clusters (anyscale.com)
		44 points by robertnishihara 10 months ago \| past \| 10 comments
13.		Roll: Reinforcement Learning Optimization for Large-Scale Learning (github.com/alibaba)
		1 point by robertnishihara 10 months ago \| past
14.		An Open Source Stack for AI Compute: Kubernetes and Ray and PyTorch and VLLM (anyscale.com)
		1 point by robertnishihara 10 months ago \| past
15.		Uv and Ray: Pain-Free Python Dependencies in Clusters (anyscale.com)
		1 point by robertnishihara on Feb 28, 2025 \| past
16.		Ray Batch Inference at Pinterest (Part 3) (medium.com/pinterest-engineering)
		1 point by robertnishihara on Oct 16, 2024 \| past
17.		Direct Preference Optimization with Synthetic Data on Anyscale (anyscale.com)
		1 point by robertnishihara on Aug 21, 2024 \| past
18.		Building an LLM Router for High-Quality and Cost-Effective Responses (anyscale.com)
		1 point by robertnishihara on July 2, 2024 \| past
19.		Ray Infrastructure at Pinterest (medium.com/pinterest-engineering)
		1 point by robertnishihara on June 18, 2024 \| past
20.		Lessons from training a Stable Diffusion model on 2B images (anyscale.com)
		5 points by robertnishihara on May 11, 2024 \| past
21.		Canva Built a Modern AI Platform Using Anyscale (anyscale.com)
		2 points by robertnishihara on April 3, 2024 \| past
22.		Building RAG-Based LLM Applications for Production (anyscale.com)
		2 points by robertnishihara on Feb 14, 2024 \| past
23.		Fine-tuning LLMs for longer context and better RAG systems (anyscale.com)
		1 point by robertnishihara on Feb 13, 2024 \| past
24.		Two-day hands-on RAG Bootcamp for developers (twitter.com/martin_casado)
		2 points by robertnishihara on Jan 31, 2024 \| past
25.		RAG at Scale: 10x Cheaper Embedding Computations with Anyscale and Pinecone (anyscale.com)
		1 point by robertnishihara on Jan 16, 2024 \| past
26.		Comparing LLM Performance: Introducing the Open Source Leaderboard for LLM APIs (anyscale.com)
		2 points by robertnishihara on Dec 21, 2023 \| past
27.		LLMPerf Leaderboard (github.com/ray-project)
		5 points by robertnishihara on Dec 21, 2023 \| past
28.		Anyscale Endpoints: JSON Mode and Function Calling Features (anyscale.com)
		2 points by robertnishihara on Dec 14, 2023 \| past
29.		LLM summarization: A case study of human, Llama-2, & GPT-4 summarization quality (anyscale.com)
		1 point by robertnishihara on Nov 10, 2023 \| past
30.		Reproducible Performance Metrics for LLM Inference (anyscale.com)
		2 points by robertnishihara on Nov 2, 2023 \| past
		More