Skip to content
Navigation menu
Search
Powered by Algolia
Search
Log in
Create account
DEV Community
Close
#
llm
Follow
Hide
Posts
Left menu
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
Right menu
DiffusionGemma: How Google's New Open LLM Hits 1,000 Tokens/sec and Changes Inference Economics
Sayed Ali Alkamel
Sayed Ali Alkamel
Sayed Ali Alkamel
Follow
Jun 12
DiffusionGemma: How Google's New Open LLM Hits 1,000 Tokens/sec and Changes Inference Economics
#
ai
#
machinelearning
#
llm
#
developers
5
reactions
Comments
Add Comment
4 min read
5 Ways Prompt Injection Can Silently Compromise Your AI App
Nigel Rizzo
Nigel Rizzo
Nigel Rizzo
Follow
Jun 12
5 Ways Prompt Injection Can Silently Compromise Your AI App
#
ai
#
webdev
#
security
#
llm
Comments
Add Comment
4 min read
LLM KV Cache Optimization, Open Model Evaluation, & Agent Engineering Skills for Local Deployment
soy
soy
soy
Follow
Jun 12
LLM KV Cache Optimization, Open Model Evaluation, & Agent Engineering Skills for Local Deployment
#
ai
#
llm
#
selfhosted
Comments
Add Comment
3 min read
I built a "boring" RAG demo over World Cup data — SQLite, sqlite-vec, and no framework
Parmod Gandhi
Parmod Gandhi
Parmod Gandhi
Follow
Jun 12
I built a "boring" RAG demo over World Cup data — SQLite, sqlite-vec, and no framework
#
rag
#
ai
#
sqlite
#
llm
Comments
Add Comment
4 min read
Prompt cache, finally typed: shipping llm-ports 0.1.0-alpha.19
Babak Abbaschian
Babak Abbaschian
Babak Abbaschian
Follow
Jun 12
Prompt cache, finally typed: shipping llm-ports 0.1.0-alpha.19
#
ai
#
typescript
#
architecture
#
llm
Comments
Add Comment
6 min read
Memory Poisoning: The Silent Threat to AI Agents (and How to Defend Against It)
Vaishnavi Gudur
Vaishnavi Gudur
Vaishnavi Gudur
Follow
Jun 12
Memory Poisoning: The Silent Threat to AI Agents (and How to Defend Against It)
#
ai
#
security
#
python
#
llm
Comments
Add Comment
2 min read
Why You Need to Become a Neuro-Punk Right Now
Artem X
Artem X
Artem X
Follow
Jun 12
Why You Need to Become a Neuro-Punk Right Now
#
ai
#
llm
#
gpu
Comments
Add Comment
6 min read
Token Cost Optimization: How to Cut LLM Inference Spend Without Cutting Quality
Nolan Vale
Nolan Vale
Nolan Vale
Follow
Jun 12
Token Cost Optimization: How to Cut LLM Inference Spend Without Cutting Quality
#
ai
#
llm
#
performance
#
rag
1
reaction
Comments
Add Comment
5 min read
Model Context Protocol (MCP): Giao Thức Tương Lai Cho AI
MaiTamDev
MaiTamDev
MaiTamDev
Follow
Jun 12
Model Context Protocol (MCP): Giao Thức Tương Lai Cho AI
#
modelcontextprotocol
#
mcp
#
giaothucngucanhmohinh
#
llm
Comments
Add Comment
12 min read
Helicone is in maintenance mode. So I built the lightweight alternative I wanted.
Javokhir Khusanov
Javokhir Khusanov
Javokhir Khusanov
Follow
Jun 12
Helicone is in maintenance mode. So I built the lightweight alternative I wanted.
#
showdev
#
ai
#
llm
#
monitoring
Comments
Add Comment
2 min read
# Introducing Leangetic: a local-first compiler for cheaper AI agents
DnaFIN
DnaFIN
DnaFIN
Follow
Jun 12
# Introducing Leangetic: a local-first compiler for cheaper AI agents
#
showdev
#
agents
#
ai
#
llm
Comments
Add Comment
3 min read
Scaling an LLM Scoring Pipeline From One Job to 10,000 a Day
Abdul Rehman
Abdul Rehman
Abdul Rehman
Follow
Jun 12
Scaling an LLM Scoring Pipeline From One Job to 10,000 a Day
#
llm
#
architecture
#
jobboard
#
production
Comments
Add Comment
5 min read
LLM-powered extraction kept silently corrupting my database. Here's what I built to fix it. tags: node, llm, opensource, api
Joyal Seejo
Joyal Seejo
Joyal Seejo
Follow
Jun 12
LLM-powered extraction kept silently corrupting my database. Here's what I built to fix it. tags: node, llm, opensource, api
#
api
#
llm
#
nlp
#
node
Comments
Add Comment
3 min read
Kong AI Gateway vs TrueFoundry: the honest version of this comparison
Sahajmeet Kaur
Sahajmeet Kaur
Sahajmeet Kaur
Follow
Jun 12
Kong AI Gateway vs TrueFoundry: the honest version of this comparison
#
ai
#
apigateway
#
mcp
#
llm
Comments
Add Comment
7 min read
Every post my engine wrote hit 200 characters. Here is the fix.
Deva
Deva
Deva
Follow
Jun 12
Every post my engine wrote hit 200 characters. Here is the fix.
#
ai
#
automation
#
devjournal
#
llm
Comments
Add Comment
2 min read
👋
Sign in
for the ability to sort posts by
relevant
,
latest
, or
top
.
We're a place where coders share, stay up-to-date and grow their careers.
Log in
Create account