Tools & Tech

Tooling notes, implementation details, and small experiments.

LLMsinferencecost-optimizationGeminiOpenAIAWStool

Flex Inference: 50% Off LLM Calls on Gemini, OpenAI, and Bedrock

Every major AI provider now offers half-price inference if you can tolerate a few extra seconds of latency. One parameter change. Same API. Here's how it works and why.

testy.coolMay 12, 20264 min read
Flex Inference: 50% Off LLM Calls on Gemini, OpenAI, and Bedrock
agentscommerceprotocolsaistandards

UCP - How to Actually Make Money With It

Universal Commerce Protocol lets AI agents buy things. Here's how developers can monetize it and what store owners need to know.

testy.coolJanuary 23, 20264 min read
UCP - How to Actually Make Money With It