<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[The Equation]]></title><description><![CDATA[Achieving balance for optimal performance]]></description><link>https://www.equationblog.com</link><image><url>https://substackcdn.com/image/fetch/$s_!-MQk!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png</url><title>The Equation</title><link>https://www.equationblog.com</link></image><generator>Substack</generator><lastBuildDate>Mon, 06 Apr 2026 19:57:43 GMT</lastBuildDate><atom:link href="https://www.equationblog.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Ruslan Belkin]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[ruslan@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[ruslan@substack.com]]></itunes:email><itunes:name><![CDATA[Ruslan Belkin]]></itunes:name></itunes:owner><itunes:author><![CDATA[Ruslan Belkin]]></itunes:author><googleplay:owner><![CDATA[ruslan@substack.com]]></googleplay:owner><googleplay:email><![CDATA[ruslan@substack.com]]></googleplay:email><googleplay:author><![CDATA[Ruslan Belkin]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Reliability knobs for agents]]></title><description><![CDATA[Spring 2026 update]]></description><link>https://www.equationblog.com/p/reliability-knobs-for-agents</link><guid isPermaLink="false">https://www.equationblog.com/p/reliability-knobs-for-agents</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Mon, 30 Mar 2026 14:15:11 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-MQk!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The agent conversation is as noisy as it has ever been:</p><ul><li><p>One camp says base models are now good enough, just give them tools.</p></li><li><p>Another (the moi is mostly in. it) says evaluation is the bottleneck, just build better judges.</p></li></ul><p>While I would assign greater weight to the second camp, they are both directionally right and still too coarse.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>The durable gains are showing up somewhere less glamorous and much more useful: new reliability knobs. Not one magical agent architecture, but a stack of control surfaces that help systems lose less intent, preserve more capability, remember more of the right state, and fail in ways teams can actually inspect and correct.</p><p>Several papers of note along these problems:</p><h2><strong>1. PPS: make intent explicit before the model starts guessing</strong></h2><p>Natural-language prompts have a hidden failure mode: intent transmission loss. The user knows what they mean; the model only sees the compressed, underspecified surface form. The PPS paper attacks that directly. Across 60 tasks, 3 domains, 3 models, and 540 generations, natural-language-rendered PPS outperformed both simple prompts and raw JSON on goal alignment. The gains were strongest in ambiguous business tasks, weaker in technical tasks, and actually reversed in low-ambiguity travel planning. That last part matters. It means structured prompting is not magic; it is most valuable when the user&#8217;s objective is fuzzy enough to be misread. The preliminary survey result is also practical: fewer follow-up rounds, from 3.33 to 1.13 on average.</p><p><strong>Why we should care</strong>: a lot of agent failure is downstream of an upstream ambiguity. If the goal, audience, constraints, tone, and success criteria are hazy at turn one, the rest of the trajectory is just confident error propagation. The other useful lesson here is that raw structure is not enough. In the study, rendered PPS beat raw JSON. So the knob is not &#8220;add schema everywhere.&#8221; It is &#8220;make intent explicit in a form the model can actually use.&#8221;</p><p>Paper: <a href="https://arxiv.org/abs/2603.18976">https://arxiv.org/abs/2603.18976</a></p><h2><strong>2. Prompt repetition: a surprisingly cheap robustness hack</strong></h2><p>This one sounds silly until you think about what causal attention is doing.</p><p>The core idea is simple: when reasoning is not enabled, repeat the prompt. The paper shows that prompt repetition improved accuracy across popular models without increasing output length or materially increasing latency in most settings. On their experiments, it won 47 of 70 benchmark-model combinations with 0 losses when reasoning was disabled. The gains were especially strong when the prompt order was hostile to the model, such as options-first multiple choice, and on custom tasks like NameIndex and MiddleMatch.</p><p><strong>Why we should care</strong>: not every model call inside an agent should be a long reasoning trace. Some calls are cheap subroutines: route this, extract that, normalize this, draft tool arguments, re-check the user constraint. For those non-reasoning calls, prompt repetition looks like a genuinely useful default knob. It is not a substitute for reasoning, and it is not free on arbitrarily long prompts, but for short operational subcalls it is exactly the kind of low-cost robustness trick teams tend to underrate.</p><p>Paper: <a href="https://ar5iv.labs.arxiv.org/html/2512.14982v1">https://ar5iv.labs.arxiv.org/html/2512.14982v1</a></p><h2><strong>3. GLM-5: do not buy agentic RL by deleting earlier skills</strong></h2><p>Sequential post-training has a nasty habit: each new stage can quietly sand down the thing the previous stage got good at.</p><p>GLM-5 is interesting partly because it says that out loud. Their pipeline runs sequential RL stages for reasoning, then agentic behavior, then general helpfulness, and then uses on-policy cross-stage distillation as a final refinement to recover skills from earlier stages. Previous stage checkpoints become teachers; the final pass is meant to stop the classic &#8220;more agentic, less sharp&#8221; tradeoff from becoming acceptable collateral damage. On their reported benchmarks, GLM-5 posts about a 20% average gain over GLM-4.7 across agentic, reasoning, and coding tasks, including 77.8 on SWE-bench Verified.</p><p>The more durable lesson is even lower-level than that. The paper is refreshingly explicit that agentic RL stability lives in systems details as much as in objectives. They switched to a deterministic top-k operator because nondeterministic sparse-attention selection caused sharp RL degradation, froze the indexer during RL for stability, and emphasized token-in-token-out handling so the trainer learns on exactly the same token stream produced by the rollout engine. That is the kind of detail that separates &#8220;agent demo&#8221; from &#8220;agent training system.&#8221;</p><p><strong>Why we should care</strong>: a lot of agent improvement work still acts as if new capabilities can simply be stacked. In practice, they interfere. If post-training adds planning but dulls reasoning, or adds autonomy but destabilizes the learning loop, you have not really improved the agent. You just moved the failure somewhere harder to notice.</p><p>Paper: <a href="https://ar5iv.labs.arxiv.org/html/2602.15763v2">https://ar5iv.labs.arxiv.org/html/2602.15763v2</a></p><h2><strong>4. FullStack-Agent: round-trip the artifact, then test the hidden surfaces</strong></h2><p>I like this one because it attacks a very real agent failure mode: the frontend looks right, the demo works, and the backend is still fake.</p><p>FullStack-Agent combines three ideas. First, a multi-agent development workflow with specialized debugging tools for frontend and backend work. Second, FullStack-Bench, which evaluates not just frontend behavior but backend APIs and database state as well. Third, Repository Back-Translation, which converts existing real-world repositories into agent trajectories the model can learn from. The benchmark itself is notable: 101 instructions, 647 frontend tests, 604 backend tests, and 389 database tests. Even better, frontend success is not counted unless the required database interaction is real. That is exactly the kind of hidden-surface check agent evaluation needs more of.</p><p>The results are strong, but the more interesting pattern is the training and evaluation shape. FullStack-Dev with a Qwen backbone reached 64.7 frontend, 77.8 backend, and 77.9 database accuracy, while FullStack-Learn improved a 30B model through self-improvement using repository back-translation and augmentation. The debugging tools also mattered a lot: removing the backend debugging tool increased average backend iterations from 74.9 to 115.5. That is not just a model story. It is a workflow design story.</p><p><strong>Why we should care</strong>: reliable coding agents need falsifiable artifacts. A useful practical extension of this idea is round-tripping: code to spec, spec back to code, compare the two, and inspect the mismatch. That creates a verifier surface instead of treating the codebase as one opaque blob. More broadly, the paper is a reminder that real artifacts and real tests are better teachers than synthetic vibes.</p><p>Paper: <a href="https://arxiv.org/html/2602.03798v1">https://arxiv.org/html/2602.03798v1</a></p><h2><strong>5. MSA: memory should be part of the model, not a retrieval afterthought</strong></h2><p>Agents with long histories do not just need more context. They need memory that stays usable when the history becomes absurd.</p><p>MSA pushes that idea hard. The paper proposes an end-to-end trainable memory framework with sparse attention, document-wise RoPE, KV compression, and a Memory Parallel inference path. The headline is the kind of number people usually ignore until it becomes operationally relevant: less than 9% degradation while scaling from 16K to 100M tokens, with 100M-token inference on 2xA800 GPUs. The other important piece is Memory Interleave, which alternates retrieval, context expansion, and generation so the model can reason across scattered memory segments instead of just pulling one flat chunk and hoping.</p><p><strong>Why we should care</strong>: a lot of current agent memory stacks are really retrieval pipelines wearing a memory costume. That works until the task needs long-range consistency, multi-hop evidence integration, or stable persona/state over time. MSA is interesting because it tries to make memory intrinsic and differentiable rather than bolted on. The real caveat is operational: the current setup still relies on offline pre-encoding of the corpus. So it is not a universal replacement for dynamic knowledge systems yet. But as a direction, it is much closer to agent memory than &#8220;just add bigger RAG.&#8221;</p><p>Paper: <a href="https://arxiv.org/abs/2603.23516">https://arxiv.org/abs/2603.23516</a></p><h2><strong>6. Verifier&#8211;compiler loops: verification is becoming its own stack</strong></h2><p>This is the one I keep coming back to.</p><p>The core production fact is ugly and simple: long workflows multiply small defects. In the verifier&#8211;compiler loop framing, a 1% failure rate across 100 steps leaves only about 36.6% end-to-end success. Even 0.1% per-step failure still leaves only about 90.5%. That is the march-of-nines problem. The implication is that agent reliability is not mainly a prompt problem. It is an error-correction problem. The system needs to observe the episode, judge it against institutional standards, intervene conservatively, replay changes before release, and keep durable evidence of what changed and why. That is also why the distinction between execution knowledge and institutional judgment matters: the agent can know the facts and still fail the organization.</p><p>Recent judge work mostly points in the same direction. JudgeBench shows hard evaluator tasks are genuinely hard, with strong models like GPT-4o only slightly above random on some challenging judge settings. RewardBench 2 makes reward evaluation meaningfully harder than RewardBench 1 and emphasizes correlation with downstream use. DeepSeek&#8217;s GRM/SPCT line is also important because it argues that reward modeling itself can scale with more inference compute through principle generation, critique, and voting, not just with bigger training runs.</p><p>But the field is also getting more honest about calibration. Evaluative Fingerprints found near-zero inter-judge agreement while also showing that judges are individually stable enough to be fingerprinted from their rubric behavior. In other words: they are not random, they are systematically different. Separate work on LLM-as-a-judge reporting shows that evaluator bias and uncertainty should be corrected statistically, not hand-waved. On user simulation, the news is similarly mixed: SimulatorArena suggests profile-conditioned simulators can track human judgments reasonably well on some tasks, but Lost in Simulation shows simulator choice can move measured success rates by up to 9 points and systematically miscalibrate difficulty.</p><p><strong>Why we should care</strong>: one judge score is not a control system. High-reliability agents are going to need judge stacks, not judge monocultures: crisp gates for obvious defects, stronger reasoning judges for nuance, replay before release, disagreement review for hard cases, and humans on the highest-risk boundaries. Simulation will help widen coverage, but only if it is continuously calibrated against real traces.</p><p>Blog: <a href="http://www.equationblog.com/p/the-verifiercompiler-loop-turning">https://www.equationblog.com/p/the-verifiercompiler-loop-turning</a></p><h2><strong>7. IndexCache: systems work is reliability work too</strong></h2><p>This one is more infrastructure than alignment, but it belongs in the same conversation.</p><p>IndexCache starts from a simple observation: in sparse attention, adjacent layers often choose very similar top-k token sets. So instead of recomputing the indexer at every layer, reuse it across layers. On the reported results, that removes up to 75% of indexer computation with negligible quality loss, while reaching 1.82x prefill speedup and 1.48x decode speedup at 200K context. The paper also reports 70&#8211;100% top-k overlap across adjacent layers, which is the structural reason the trick works.</p><p><strong>Why we should care</strong>: efficiency is not separate from reliability. Every unit of inference cost you remove from the serving path can be reinvested into something reliability-shaped: longer context, more retrieval, more search, more verifier passes, more replay budget, or simply lower latency at the same control quality. That is why inference-side engineering keeps mattering more than people think.</p><p>Paper: <a href="https://arxiv.org/html/2603.12201v1">https://arxiv.org/html/2603.12201v1</a></p><h2><strong>The connective tissue</strong></h2><p>If I had to compress the direction into one line, it is this: <em><strong>reliable agents are becoming layered control systems</strong>.</em></p><p>Structured intent reduces loss before the trajectory begins. Prompt repetition stabilizes cheap non-reasoning subcalls. Post-training methods like cross-stage distillation try to make new capabilities additive instead of destructive. Artifact-grounded training and hidden-surface testing make agent outputs more falsifiable. Long-memory work tries to decouple memory capacity from reasoning quality. Judge research is forcing evaluation to become calibrated, replayable, and auditable. Systems work buys the budget to do more of all of it in real time.</p><p>Just a growing stack of knobs that make agent behavior narrower, more inspectable, and a little less mysterious week over week.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[The Verifier–Compiler Loop: Turning Human Preferences into Production Agent Judgment]]></title><description><![CDATA[Why verifier&#8211;compiler loops matter more than another prompt patch]]></description><link>https://www.equationblog.com/p/the-verifiercompiler-loop-turning</link><guid isPermaLink="false">https://www.equationblog.com/p/the-verifiercompiler-loop-turning</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Mon, 09 Mar 2026 14:14:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!31Us!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Agents do not usually fail in production because a prompt suddenly stopped working. They fail because a workflow that looked 98% fine in isolation turns into 30 turns, six tool calls, two handoffs, a compliance boundary, and a frustrated human on the other side.</p><p>That is the march-of-nines problem (as originally described by Andrey Karpathy). In a long workflow, tiny per-step defects compound into very large end-to-end losses. At 100 steps, a 1% failure rate at each step leaves only about 36.6% end-to-end success; even 0.1% still leaves only about 90.5%. For customer service, regulated operations, and multi-agent workflows, that gap is the difference between a promising demo and a system the business can trust.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>The practical implication is that production reliability should be treated as an error-correction problem. Better base models help, but they are only part of the story. The system also needs a way to observe what happened, judge it against the organization&#8217;s standards, intervene safely when necessary, replay changes before release, and keep durable evidence of what changed and why.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!31Us!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!31Us!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png 424w, https://substackcdn.com/image/fetch/$s_!31Us!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png 848w, https://substackcdn.com/image/fetch/$s_!31Us!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png 1272w, https://substackcdn.com/image/fetch/$s_!31Us!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!31Us!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png" width="1456" height="765" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:765,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!31Us!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png 424w, https://substackcdn.com/image/fetch/$s_!31Us!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png 848w, https://substackcdn.com/image/fetch/$s_!31Us!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png 1272w, https://substackcdn.com/image/fetch/$s_!31Us!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F61ef6284-6a40-4450-b44b-f2e768be2860_1561x820.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: center;"><em>Figure 1. In long workflows, small per-step errors compound into large end-to-end losses.</em></p><p style="text-align: center;">Reliability should come from continuously catching, correcting, compiling, and proving small errors &#8212; not from hoping one prompt patch will cover every edge case.</p><p><strong>Production failures are not only knowledge failures</strong></p><p>A surprisingly large share of production failures are not missing-fact failures. The agent may know the product terms, retrieve the right document, or call the right tool and still fail the institution.</p><p>Four surfaces matter in practice: missing knowledge, institutional judgment, user affect, and evidence. A response can be factually correct but still violate policy, choose the wrong level of certainty, worsen the user&#8217;s emotional trajectory, or leave the team unable to reconstruct what happened well enough to approve a safe fix.</p><ul><li><p><strong>Missing knowledge: </strong>The agent lacks a fact, a tool result, or an updated policy exception.</p></li><li><p><strong>Judgment misalignment: </strong>The facts are present, but the trade-off between speed, certainty, empathy, policy, or escalation is wrong.</p></li><li><p><strong>Affect regression: </strong>The reply is technically valid but increases frustration, distrust, or confusion.</p></li><li><p><strong>Evidence gap: </strong>The team cannot replay the episode, inspect the rewrite, or approve the next release with confidence.</p><p></p></li></ul><p>A banking example makes the distinction concrete. An assistant can know the product, know the account state, and still reply in a way that is too dismissive, too certain, or too slow to escalate.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!3Gry!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3Gry!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png 424w, https://substackcdn.com/image/fetch/$s_!3Gry!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png 848w, https://substackcdn.com/image/fetch/$s_!3Gry!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png 1272w, https://substackcdn.com/image/fetch/$s_!3Gry!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3Gry!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png" width="944" height="496" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:496,&quot;width&quot;:944,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:122465,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.equationblog.com/i/190227900?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3Gry!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png 424w, https://substackcdn.com/image/fetch/$s_!3Gry!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png 848w, https://substackcdn.com/image/fetch/$s_!3Gry!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png 1272w, https://substackcdn.com/image/fetch/$s_!3Gry!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03d3aa13-f0e9-4d50-8e18-bafa20b04819_944x496.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: center;"><em>Figure 2. A customer-facing answer can know the facts and still miss the institution&#8217;s judgment on policy, style, affect, or action.</em></p><p><strong>Judgment should be explicit</strong></p><p>One way to make this tractable is to keep execution knowledge and institutional judgment separate, but versioned together. Skills describe how the work gets done: task-specific success criteria, approved procedures, tool permissions, and required evidence. Judgment describes how the organization wants that work done: risk boundaries, policy rules, quality bars, tone, trade-offs, escalation behavior, and release gates.</p><p>That separation matters because business policy changes faster than workflow logic. If a new escalation rule or compliance boundary requires rewriting every skill from scratch, the system becomes brittle. If judgment is explicit, global changes can be compiled once and enforced across many workflows.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tGvT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tGvT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png 424w, https://substackcdn.com/image/fetch/$s_!tGvT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png 848w, https://substackcdn.com/image/fetch/$s_!tGvT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png 1272w, https://substackcdn.com/image/fetch/$s_!tGvT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tGvT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png" width="1456" height="765" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:765,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tGvT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png 424w, https://substackcdn.com/image/fetch/$s_!tGvT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png 848w, https://substackcdn.com/image/fetch/$s_!tGvT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png 1272w, https://substackcdn.com/image/fetch/$s_!tGvT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F13cb7838-06b9-42c8-bfaf-46961435e6b1_1561x820.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: center;"><em>Figure 3. A reliable agent should keep workflow execution and institutional judgment separate, but versioned together as a behavioral contract.</em></p><p><strong>An example of a production flywheel</strong></p><p>A useful production flywheel is not a vague analytics dashboard. It turns one live interaction into an ordered, auditable episode. In one practical pattern, a conversation becomes: user message &#8594; draft response &#8594; affect and intent signals &#8594; initial evaluation &#8594; rewrite decision &#8594; post-rewrite evaluation &#8594; delivered message.</p><p>Once that structure exists, the same episode can be replayed later, reviewed by humans, approved or rejected, and compiled into the next version of the behavioral contract. Runtime interventions can stay conservative &#8212; rewrite, route, pause, slow-halt, or escalate &#8212; while the offline loop decides what should become a durable product change.</p><p>This is the shift from prompt folklore to production engineering. A corrected response is useful at the moment; a replayable, reviewable correction is useful week over week.</p><p><strong>A few operating patterns tend to work</strong></p><ul><li><p>Observe real traces, not only synthetic eval sets.</p></li><li><p>Keep affect, policy, and tool behavior in the same episode view.</p></li><li><p>Replay proposed changes before release rather than patching blindly in production.</p></li><li><p>Treat approvals and evidence as part of the product, not as after-the-fact documentation.</p></li></ul><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vYkO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vYkO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png 424w, https://substackcdn.com/image/fetch/$s_!vYkO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png 848w, https://substackcdn.com/image/fetch/$s_!vYkO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png 1272w, https://substackcdn.com/image/fetch/$s_!vYkO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vYkO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png" width="1456" height="765" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:765,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vYkO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png 424w, https://substackcdn.com/image/fetch/$s_!vYkO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png 848w, https://substackcdn.com/image/fetch/$s_!vYkO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png 1272w, https://substackcdn.com/image/fetch/$s_!vYkO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F46e25816-8b6c-412a-b200-ba5ac1d95aae_1561x820.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: center;"><em>Figure 4. A production flywheel turns one conversation into an ordered, replayable episode that can be inspected, approved, and released with confidence.</em></p><p><strong>Judge quality now depends on calibration</strong></p><p>Recent judge research points in two directions at once. On one hand, judges are improving. Benchmarks such as JudgeBench and RewardBench 2 are making judge quality easier to measure, while DeepSeek&#8217;s GRM / SPCT work suggests that principle generation, critique, and inference-time aggregation can make reward modeling and preference judging much stronger in practice.</p><p>On the other side, the field is getting more honest about calibration. Newer 2025&#8211;2026 work argues that one raw judge score is not enough. Some evaluators show stable but different &#8220;evaluative fingerprints,&#8221; meaning they can be internally consistent while still disagreeing systematically with one another. Other papers show that rubric order, pointwise versus pairwise framing, and judge allocation can shift rankings if those choices are left uncalibrated.</p><p>The production lesson is simple: one judge should not be the whole control system. High-reliability systems tend to use a judge stack &#8212; crisp gates for clear defects, calibrated reasoning judges for nuance, replay for release decisions, disagreement review for hard cases, and humans for the highest-risk boundaries.</p><p style="text-align: center;">The newest question is no longer only &#8220;Which judge scores highest on a benchmark?&#8221; It is also &#8220;Which judge remains stable, interpretable, and properly calibrated inside a release process?&#8221;</p><p><strong>User simulation can help, but only with calibration</strong></p><p>User simulation is becoming necessary because replay without synthetic users does not cover enough edge cases. Recent work such as SimulatorArena suggests that profile-conditioned simulators can track human ratings reasonably well on some multi-turn tasks, especially when the simulator has access to richer user profiles rather than a generic system prompt.</p><p>But simulation should not be treated as ground truth. Lost in Simulation is the warning label: simulator choice can materially change measured success, and simulated populations can drift away from real human behavior. The practical pattern is simulation plus calibration &#8212; use synthetic users to widen test coverage, then measure the gap to hold-out human traces and correct for it.</p><p><strong>Auditability should be part of the product</strong></p><p>If the team cannot reconstruct which draft was blocked, which rewrite was sent, what evidence triggered the intervention, and which version of the behavioral contract approved the change, then it does not really have a production flywheel. It has prompt folklore.</p><p>Auditability is what turns one corrected episode into a durable release decision. In practice, that usually means keeping trace and event correlation, judge verdicts, rewrites, replay results, approval records, and release decisions together. The goal is not paperwork. The goal is to make the next change easier to inspect, safer to ship, and easier to trust.</p><p><strong>How this connects to my ODSC East talk</strong></p><p>At ODSC East 2026, I&#8217;ll go deeper into the mechanics behind this pattern: how judge stacks can be calibrated, how runtime interventions can be chosen without creating new risk, how replay should precede release, and how week-over-week improvements can turn human preferences into durable production agent judgment.</p><p>The talk title is &#8220;<strong>The Verifier&#8211;Compiler Loop: Turning Human Preferences into Production Agent Judgment.</strong>&#8221; This article only sketches the frame. The session will go much deeper into the system design and operating model behind it.</p><p><strong>Selected references</strong></p><ul><li><p><a href="https://arxiv.org/abs/2410.12784">JudgeBench</a></p></li><li><p><a href="https://arxiv.org/abs/2506.01937">RewardBench 2</a></p></li><li><p><a href="https://arxiv.org/abs/2504.02495">Inference-Time Scaling for Generalist Reward Modeling (DeepSeek GRM / SPCT)</a></p></li><li><p><a href="https://arxiv.org/abs/2511.21140">How to Correctly Report LLM-as-a-Judge Evaluations</a></p></li><li><p><a href="https://arxiv.org/abs/2601.05114">Evaluative Fingerprints: Stable and Systematic Differences in LLM Evaluator Behavior</a></p></li><li><p><a href="https://arxiv.org/abs/2602.16610">Who can we trust? LLM-as-a-jury for Comparative Assessment</a></p></li><li><p><a href="https://arxiv.org/abs/2510.05444">SimulatorArena</a></p></li><li><p><a href="https://arxiv.org/abs/2601.17087">Lost in Simulation</a></p></li></ul><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Agent reliability]]></title><description><![CDATA[A narrow look at what the last ~90 days of research is teaching us]]></description><link>https://www.equationblog.com/p/agent-reliability</link><guid isPermaLink="false">https://www.equationblog.com/p/agent-reliability</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Mon, 26 Jan 2026 15:15:42 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-MQk!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The field is moving extremely fast right now. New agent stacks, new evals, new post-training tricks - the whole ecosystem shifts weekly.</p><p>But if you ship agents, you learn a painful lesson fast:</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><strong>An agent that succeeds once is not a reliable agent.</strong></p><p>Single-run success rates are demo metrics. Production reliability is a different game:</p><ol><li><p>Consistency across runs (the same task, same setup, multiple attempts)</p></li><li><p>Robustness to &#8220;equivalent&#8221; user inputs (paraphrases, small spec changes, harmless reorderings)</p></li><li><p>Grace under tool/API failures (because they will fail - timeouts, rate limits, partial responses, schema drift)</p></li><li><p>If I had to compress the theme of the last ~90 days into one line, it&#8217;s this:</p></li></ol><p><strong>Reliability is a surface, not a score.</strong></p><p>This post is intentionally narrow: recent work that treats agent reliability as a first-class object - not a vibe.</p><h1>The production reality check: humans are still the reliability layer</h1><p>A paper I keep pointing people to is &#8220;Measuring Agents in Production.&#8221; It&#8217;s one of the rare efforts that asks practitioners what&#8217;s actually working (and what&#8217;s breaking).</p><p>A few findings that stuck with me:</p><ul><li><p>Many production agents are built to be simple and controllable: 68% run at most 10 steps before requiring human intervention.</p></li><li><p>Most teams lean on prompting off-the-shelf models vs weight tuning (70%), and rely primarily on human evaluation (74%).</p></li><li><p>Reliability shows up as the top challenge - especially &#8220;ensuring and evaluating correctness.&#8221;</p></li></ul><p><strong>That&#8217;s the current equilibrium: humans as circuit breakers.</strong></p><p>The real question is how we scale beyond that without lying to ourselves about what &#8220;reliable&#8221; means.</p><h2>ReliabilityBench: measuring reliability as a surface, not a score</h2><p><a href="https://arxiv.org/abs/2601.06112">ReliabilityBench</a> is exactly the kind of benchmark we&#8217;ve needed.</p><p>Instead of asking &#8220;did it succeed,&#8221; it asks:</p><ol><li><p>Does it succeed again (consistency)</p></li><li><p>Does it succeed under equivalent variations of the task (robustness)</p></li><li><p>Does it survive tool/API failures (fault tolerance)</p><p></p></li></ol><p>They formalize this across three dimensions:</p><ul><li><p>pass^k for repeated execution</p></li><li><p>perturbation intensity &#949;</p></li><li><p>fault intensity &#955;</p></li><li><p>...and propose a unified reliability surface: R(k, &#949;, &#955;).</p></li></ul><p>Two ideas here that I think will stick:</p><ol><li><p>Action metamorphic relations: judge correctness by end-state equivalence rather than brittle text matching.</p></li><li><p>Chaos-style fault injection: simulate timeouts, rate limits, partial responses, schema drift.</p></li></ol><p>The reported results are the point:</p><ol><li><p>Perturbations alone reduced success from 96.9% at &#949;=0 to 88.1% at &#949;=0.2.</p></li><li><p>Rate limiting was especially damaging.</p></li></ol><p>This is what &#8220;production-like&#8221; really means: not one clean run, but performance under stress.</p><p><strong>Why we should care:</strong></p><ol><li><p>If you only track single-run pass rates, you end up optimizing for demos.</p></li><li><p>A reliability surface forces the conversation into repeatability, robustness, and failure modes.</p></li></ol><h2>E-valuator: turn &#8220;judge scores&#8221; into runtime decisions (with guarantees)</h2><p>Assume you&#8217;ve built a verifier (LLM judge, PRM, heuristics). You can score trajectories - but can you trust the score enough to make a runtime decision?</p><p><a href="https://arxiv.org/abs/2512.04123">E-valuator</a> reframes this as a sequential hypothesis testing problem: distinguish successful vs unsuccessful trajectories as actions unfold, using a statistically valid test at every step.</p><p>They propose converting any black-box verifier score into a decision rule with controlled false-alarm rates, and show it can both improve monitoring and terminate problematic trajectories early to save tokens.</p><p><strong>Why we should care:</strong></p><ol><li><p>&#8220;Judge reliability&#8221; is now a core dependency for agent reliability.</p></li><li><p>This is one path from heuristics to operational control.</p></li></ol><h2>LLMdoctor: test-time steering as a reliability tool</h2><p>Benchmarks and verifiers tell you &#8220;it broke.&#8221;</p><p>But reliability also requires you &#8220;fix it now.&#8221;</p><p>That&#8217;s why I like test-time alignment approaches that are modular. <a href="https://arxiv.org/abs/2601.10416">LLMdoctor</a> has a clean patient-doctor framing: steer a frozen model with a smaller controller trained on token-level preference signals, via token-level flow-guided preference optimization.</p><p>Even if you ignore the specific algorithm, the pattern matters:</p><ol><li><p>You can steer without retraining the foundation.</p></li><li><p>You can make reliability interventions fast and reversible.</p></li><li><p>You can version and evaluate the controller like a product.</p></li></ol><p><strong>Why we should care:</strong></p><ul><li><p>Most teams treat reliability fixes as either &#8220;change the prompt&#8221; or &#8220;fine-tune and pray.&#8221;</p></li><li><p>Controller-style steering gives a third option: a scoped, testable intervention layer.</p></li></ul><h2>Human-in-the-loop rubrics: reliability is often a &#8220;shared standard&#8221; problem</h2><p>The hardest part of agent reliability isn&#8217;t always the model.</p><p>Sometimes it&#8217;s the absence of a shared, auditable definition of &#8220;correct.&#8221;</p><p>A recent <a href="https://arxiv.org/abs/2511.10865">paper</a> on patch evaluation proposes a simple but scalable framework:</p><ul><li><p>use an LLM to draft a task-specific rubric,</p></li><li><p>have a human review/refine it once,</p></li><li><p>use the rubric-guided LLM judge to evaluate many candidates.</p></li></ul><p>They report improved agreement with human consensus (e.g., Cohen&#8217;s kappa 0.75 on the subset with unanimous human agreement), plus high recall/precision in that setting.</p><p>Even though the domain is program repair, the reliability lesson generalizes:</p><p>When humans disagree, it&#8217;s often because the rubric is implicit. Make it explicit once - then scale it.</p><div><hr></div><h1>A narrow reliability loop I&#8217;d actually run</h1><p>If I had to condense the above into a practical loop (without turning it into a platform pitch), it would look like this:</p><ol><li><p><strong>Define correctness in end-states, not text: </strong>Use metamorphic relations / end-state equivalence where possible.</p></li><li><p><strong>Stress-test, don&#8217;t just benchmark: </strong>Measure a reliability surface across repeated runs (k), perturbations (&#949;), and tool failures (&#955;).</p></li><li><p><strong>Monitor online with calibrated decision rules: </strong>Turn verifier scores into stop / continue / escalate decisions you can defend.</p></li><li><p><strong>Keep humans as reviewers of standards, not full-time graders: </strong>Use human time to approve/refine rubrics and resolve disagreements.</p></li><li><p><strong>Treat steering as a first-class intervention: </strong>Controller models (doctor -&gt; patient) are a pragmatic way to improve behavior without turning every fix into a full retrain.</p></li></ol><div><hr></div><p>Agent reliability is not a single feature and not a single metric. It&#8217;s a contract:</p><ol><li><p>measured under stress</p></li><li><p>monitored online</p></li><li><p>improved with small, controlled interventions</p></li><li><p>audited through shared standards humans can actually read.</p></li></ol><p>The best recent work is finally treating reliability like an object we can engineer - not a hope we can prompt.</p><p><strong>Quick credit</strong>: <strong><a href="https://www.linkedin.com/in/aw-ai/">Andy Wong</a></strong> consistently finds great new papers early, and we end up debating the implications together before they show up in writing.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Control knobs from recent LLM papers]]></title><description><![CDATA[January 2026 Update]]></description><link>https://www.equationblog.com/p/control-knobs-from-recent-llm-papers</link><guid isPermaLink="false">https://www.equationblog.com/p/control-knobs-from-recent-llm-papers</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Tue, 20 Jan 2026 15:15:42 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-MQk!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The field is moving extremely fast right now. The half-life of a new idea is measured in weeks (sometimes days), and it is getting harder to tell what will actually stick.</p><p>Most weeks, the discourse around language models collapses into one of two modes:</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><ul><li><p>&#8220;Look at this new leaderboard bump.&#8221;</p></li><li><p>&#8220;Agents are coming, everything changes.&#8221;</p></li></ul><p>Both can be true - and still miss the point.</p><p>The stuff that actually moves production outcomes tends to look like new control knobs: better ways to steer models, keep them stable, make them faster, and make failures more legible.</p><p>Credit where it is due: a bunch of these paper finds came from <a href="https://www.linkedin.com/in/aw-ai/">Andy Wong</a>, who consistently surfaces great work. We usually end up debating the implications together before it shows up here.</p><p>So here is my January reading stack: papers that feel unusually primitive-shaped. Each one adds a knob I expect we will keep using.</p><h2>1. <a href="https://www.linkedin.com/posts/rbelkin_251224601-activity-7419125318424145920-Xyaz">Recursive Language Models</a>: &#8220;infinite&#8221; context via self-calls (no retraining)</h2><p>Recursive Language Models (RLMs) are a different answer to long context: do not cram the prompt into the transformer - treat it like part of the environment.</p><p>One concrete instantiation: the prompt becomes a variable inside a Python REPL, and the model writes code to inspect the prompt, decompose it, and recursively call sub-instances of itself over slices of the prompt.</p><p>They report handling inputs two orders of magnitude beyond typical context windows - and even mention strong performance at the 10M+ token scale.</p><p><strong>Why we should care:</strong></p><ul><li><p>This pushes long context from architecture into systems. Not &#8216;train longer,&#8217; but &#8216;reason out-of-core.&#8217;</p></li><li><p>It is an agent-shaped pattern: when the model can write the loop it thinks inside, you get a new class of tool-use + decomposition behaviors.</p></li></ul><p>It reframes the bottleneck: the limit becomes less context length and more how good the model is at building the right indexing + recursion strategy.</p><h2>2. <a href="https://www.linkedin.com/posts/rbelkin_powerpoint-%E6%BC%94%E7%A4%BA%E6%96%87%E7%A8%BF-activity-7418418556901470208-iaSR?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAAPDpcBcLu8_J9WkZjSGCZolugTdGmprHo">LLMdoctor</a>: alignment at test-time, token by token</h2><p>Most alignment work still assumes you either fine-tune the whole model (expensive, slow, often brittle), or you do test-time tricks that are coarse (trajectory-level) and compute-hungry.</p><p>LLMdoctor proposes a clean separation: keep a big &#8216;patient&#8217; model frozen, and steer it with a smaller &#8216;doctor&#8217; model using token-level signals.</p><p>Their claim is that many test-time alignment methods rely on distorted trajectory-level rewards or inefficient sampling that caps performance and harms diversity. The patient-doctor setup extracts token-level preference signals from the patient&#8217;s behavioral variations, then trains the doctor via token-level flow-guided preference optimization (TFPO) to preserve diversity while aligning outputs.</p><p><strong>Why we should care:</strong></p><ul><li><p>Steering becomes modular. Iterate on the doctor without re-baking the patient.</p></li><li><p>Granularity matters. Token-level intervention is the difference between &#8216;mostly aligned&#8217; and &#8216;aligned where it counts.&#8217;</p></li></ul><p>Closer to how agents fail. <strong>Agents do not fail at the end of the trajectory - they fail mid-trajectory</strong>.</p><h2>3. <a href="https://www.linkedin.com/posts/rbelkin_entropy-adaptive-fine-tuning-resolving-confident-activity-7416937560096161792--icu?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAAPDpcBcLu8_J9WkZjSGCZolugTdGmprHo">Entropy-Adaptive Fine-Tuning</a>: a practical take on &#8220;don&#8217;t forget&#8221;</h2><p>Supervised fine-tuning is still the workhorse for specialization - and &#8220;catastrophic&#8221; (or I would rather say &#8220;annoying&#8221;) forgetting is still the bill we pay.</p><p>This paper&#8217;s framing is crisp: it contrasts SFT with on-policy RL and argues the gap comes from distribution mismatch. In RL, the model&#8217;s learning signal is more consistent with its internal beliefs; in SFT, the model is forced to fit external supervision even when that conflicts sharply with what it &#8216;knows.&#8217;</p><p>They focus on confident conflicts: cases where the label token is low-probability under the model, while the model&#8217;s distribution is low entropy (i.e., it is confidently predicting something else). That is where gradients get destructive.</p><p>Their proposal, Entropy-Adaptive Fine-Tuning (EAFT), uses token-level entropy as a gating mechanism: learn aggressively when the model is uncertain; suppress gradients when the model is confident-but-disagreeing.</p><p>From my most recent post: I also think EAFT is a genuinely useful alternative to LoRA in the &#8216;don&#8217;t wreck the base model&#8217; sense - rather than constraining where we update (parameter-efficient adapters), EAFT constrains when updates should matter (skip the destructive ones).</p><p><strong>Why we should care:</strong></p><ul><li><p>This is the kind of idea that turns continuous updates from scary to feasible.</p></li><li><p>It maps to a real production vibe: most of the time we want to learn; sometimes we want to refuse the lesson.</p></li><li><p>It is a different safety knob than LoRA - but it is targeting the same anxiety: regressions.</p></li></ul><h2>4. <a href="https://www.linkedin.com/posts/rbelkin_fig1-activity-7415183051556564993-ws2d?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAAPDpcBcLu8_J9WkZjSGCZolugTdGmprHo">From Entropy to Epiplexity</a>: measuring &#8220;useful information&#8221; for bounded learners</h2><p>Data quality is still the hidden kingmaker. The hard part is: we are not data-rich, we are signal-poor.</p><p>This paper asks a deceptively simple question: can we quantify learnable content in data without tying it to a downstream task?</p><p>They argue classic information measures (Shannon entropy, Kolmogorov complexity) do not capture what matters for computationally bounded learners, and they propose a new measure: epiplexity.</p><p>The vibe: epiplexity is meant to capture structural content while excluding &#8216;time-bounded entropy&#8217; (random/unpredictable content), and the authors claim it helps explain why deterministic transformations and data ordering can still create useful learnable structure in practice.</p><p><strong>Why we should care:</strong></p><ul><li><p>If inputs are becoming the product, we eventually want a metric for the informational value of inputs.</p></li><li><p>Epiplexity feels like a step toward data selection as an engineering discipline, not an art project.</p></li></ul><h2>5. <a href="https://www.linkedin.com/posts/rbelkin_llada20techreportpdf-at-main-inclusionai-activity-7406820279944982528-DmUi?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAAPDpcBcLu8_J9WkZjSGCZolugTdGmprHo">LLaDA2.0</a>: diffusion language models to 100B</h2><p>Autoregressive decoding is powerful, but it is fundamentally serial.</p><p>LLaDA2.0 pushes discrete diffusion language models to 100B parameters via a conversion process: take a pretrained AR model and convert it to a dLLM using a 3-phase block-level training scheme (warm-up with increasing block size, stable full-sequence diffusion, decay back to compact block diffusion).</p><p>They also discuss post-training alignment with SFT and DPO, framing this as a path to frontier-scale efficiency while preserving parallel decoding advantages.</p><p><strong>Why we should care:</strong></p><ul><li><p>Parallel decoding is not just a speed story. It changes how we can spend compute at inference time.</p></li></ul><p>Faster sampling = more room for verification, search, and self-checking within real latency budgets.</p><h2>6. <a href="https://arxiv.org/abs/2509.10534">PoPE</a>: decoupling the &#8220;what&#8221; and &#8220;where&#8221; in positional embeddings</h2><p>I have a soft spot for papers that say: &#8216;this popular thing is entangled in a way that quietly hurts you,&#8217; and then fix it cleanly.</p><p>PoPE (Polar Coordinate Positional Embeddings) argues RoPE entangles content (what) and position (where), which can impair tasks requiring independent matching on the two. They propose PoPE to remove the confound, show better performance on diagnostics and across sequence modeling domains, and highlight strong zero-shot length extrapolation vs RoPE - and even vs YaRN.</p><p><strong>Why we should care:</strong></p><ul><li><p>Long context is now table stakes for serious agent workflows.</p></li><li><p>Works at 8k is not the same as behaves at 80k.</p></li></ul><h1>DeepSeek + stuff that ships</h1><h2><a href="https://www.linkedin.com/posts/rbelkin_engramengrampaperpdf-at-main-deepseek-ai-activity-7416602221993172992-rYqb?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAAPDpcBcLu8_J9WkZjSGCZolugTdGmprHo">Engram</a>: conditional memory as a second sparsity axis</h2><p>The Whale does it again. MoE gave us conditional computation. But knowledge lookup is still mostly simulated via dense compute.</p><p>Engram proposes conditional memory - a complementary axis of sparsity - implemented via an O(1) lookup module modernizing classic N-gram embeddings.</p><p>They describe a &#8216;Sparsity Allocation&#8217; tradeoff between neural computation (MoE) and static memory (Engram), claim a U-shaped scaling law, and report scaling Engram to 27B parameters with gains not just on knowledge tasks but also reasoning, code/math, and long-context retrieval.</p><h2><a href="https://www.linkedin.com/posts/rbelkin_mhc-manifold-constrained-hyper-connections-activity-7412530889903112192-8oM6?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAAPDpcBcLu8_J9WkZjSGCZolugTdGmprHo">mHC</a>: Manifold-Constrained Hyper-Connections</h2><p>This zooms in on a real training pathology: expanding residual streams/connectivity can improve performance, but it can also break the identity mapping property residual connections rely on - leading to instability and scalability issues.</p><p>mHC proposes projecting the residual connection space onto a manifold to restore identity mapping while keeping things efficient.</p><h2><a href="https://www.linkedin.com/posts/rbelkin_ppovsgrpo-activity-7415559502030241792-UhoE?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAAPDpcBcLu8_J9WkZjSGCZolugTdGmprHo">DeepSeek-R1</a>: RL-first reasoning + the GRPO refresher</h2><p>Even if you are excited about the next base model drop, R1&#8217;s training recipe is the more durable lesson.</p><p>The core claim: reasoning behaviors can emerge via pure RL (with a cold-start SFT phase for readability/stability), and they lean on GRPO - which is worth revisiting if your PPO mental model is rusty.</p><p>Quick intuition: GRPO drops the critic and estimates a baseline from grouped samples, which matters a lot for scaling RL in LLM land.</p><h2><a href="https://www.linkedin.com/posts/rbelkin_solving-llm-repetition-problem-in-production-activity-7405340339450023936-p2ve?utm_source=share&amp;utm_medium=member_desktop&amp;rcm=ACoAAAAPDpcBcLu8_J9WkZjSGCZolugTdGmprHo">Solving LLM repetition</a> in production</h2><p>This one earns points for being unapologetically real: repetition loops that stall batch tasks.</p><p>They identify repetition patterns, frame the root cause via Markov analysis + greedy decoding getting stuck in loops, and evaluate mitigations: beam search with early_stopping=True (universal post-hoc), presence_penalty (case-specific), and DPO fine-tuning (model-level universal).</p><h1>The connective tissue</h1><p>If I had to summarize the direction across these papers in one line: we are entering the era of systems that add knobs: inference-time recursion for extreme context, token-level steering, entropy-gated learning, explicit memory, better information measures, disentangled position representations, and faster decoding.</p><p>The fun part: <strong>these knobs compound.</strong></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Notes on Product Development Process?]]></title><description><![CDATA[&#8220;When a number of people concentrate on a single thought, they can compel the world to accept it.&#8221; &#8212; Paramahansa YoganandaThanks for reading The Equation!]]></description><link>https://www.equationblog.com/p/what-is-your-product-development</link><guid isPermaLink="false">https://www.equationblog.com/p/what-is-your-product-development</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Fri, 21 Nov 2025 23:15:14 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-MQk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>&#8220;When a number of people concentrate on a single thought, they can compel the world to accept it.&#8221; &#8212; Paramahansa Yogananda</em></p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p><strong>&#8220;What is your Product Development Process?&#8221;</strong> -  I&#8217;ve been asked this question several times lately&#8212;including during the <a href="https://elc.community/public/podcasts/specs-as-the-new-source-of-truth-synthetic-data-as-the-next-wave-of-defensibility-product-vision-and-decision-making-frameworks-ruslan-belkini-inflection-ai">ELC podcast</a>.</p><p>I&#8217;ve always been an engineering person. Systems, infrastructure, APIs&#8212;that&#8217;s the way my brain was wired. My interest in product and users started early at LinkedIn. Perhaps I was influenced by my product partner there - <a href="https://www.linkedin.com/in/ablue/">Allen Blue</a>, who was one of the co-founders and amazingly forward looking product visionary. Somewhere in those early cycles it clicked: we do what we do <strong>in service of our users</strong>&#8212;whether those users are developers, end users, or enterprise customers. That realization pulled me from &#8220;just engineering&#8221; into caring about product outcomes. And yet, at the start, I was a complete novice at product (and truthfully probably still am).</p><p>I read the usual suspects (<em><a href="https://www.amazon.com/Lean-Startup-Entrepreneurs-Continuous-Innovation/dp/0307887898">The Lean Startup</a></em>, <em><a href="https://www.amazon.com/Competing-Against-Luck-audiobook/dp/B01IIGPYGC/ref=sr_1_1?crid=2TDML7IMO44ZZ&amp;dib=eyJ2IjoiMSJ9.STwdk378XLZZKzi3XO7n3bZv36x8ZV9v4rW6VEizg5O0ruWENvbsbx8QiiyxXIlYTFATOvJjFYK1kqCrddUrfhG84-yehOvdGzR1lZoTtR5O9_IzmvA6Tfh58QwySM3DpS_KXZOM62jk_ZiAhRrp0S1C6smFUlLBLSuYKpmUKBboEJs1dbZaPld1myGGk8kVfQpEGcHs3DSuNG3HyWVpP1OQUNBfAAQnk0rR7UtjXik.Cz7aTIho21sm5gDrWj4AWkGpUxibmihK2OcR9cmg5m4&amp;dib_tag=se&amp;keywords=Competing+Against+Luck&amp;qid=1761935104&amp;s=books&amp;sprefix=competing+against+luck%2Cstripbooks%2C148&amp;sr=1-1">Competing Against Luck</a></em>, <em><a href="https://www.amazon.com/The-Mom-Test-Rob-Fitzpatrick-audiobook/dp/B07RJZKZ7F/ref=sr_1_1?crid=5SKS7YOVXNPK&amp;dib=eyJ2IjoiMSJ9.-qOPHHPOaY7mEvJQiz1-_SRx_FI3lGO9aA0dhokR7ms.nredWviFhEtciQULac0jny0ttHAiprVQARIcx5uNqEA&amp;dib_tag=se&amp;keywords=The+Mom+Test&amp;qid=1761935127&amp;s=audible&amp;sprefix=the+mom+test%2Caudible%2C142&amp;sr=1-1">The Mom Test</a></em>, and a mountain of blogs) about experiments, rapid iteration, speed, listening to customers, etc. Yet the truth is that a lot of product and company building is about luck&#8212;and if you play the lottery long enough, sometimes you win and the technique was perhaps less important than professed. At some point I came to a realization: there is no such thing as &#8220;finding&#8221; product&#8209;market fit beyond random luck. There is only <strong>compelling the world</strong> to accept your vision of the future.</p><p>So, here&#8217;s how I think abut the product product development process today:</p><h2>1. Imagine the future</h2><p>Think of yourself as a sci&#8209;fi <strong>writer</strong>. Imagine the future you want to see. In detail and with maximum nuance. Feel it, smell it, live it, write about it (for yourself mostly). This is not entirely a logical process. Works of art don&#8217;t come from the mind alone; they come from the soul. The more exact, detailed, and precise the imagination&#8212;the better. The approach works not only for big ideas, but also for small problems, design decisions and similar challenges. Try it. Obviously, you need a background in the field so your imagination has grounding.</p><p><strong>Where this can go wrong:</strong></p><ul><li><p><strong>The idea isn&#8217;t yours.</strong> Works of art are not a collective exercise. We often adopt other people&#8217;s ideas unconsciously&#8212;we are imitators by design (see Ren&#233; Girard&#8217;s <em>mimetic</em> theory). Your founding team can absolutely enhance and improve the idea&#8212;just don&#8217;t confuse where it came from. Maybe the product idea isn&#8217;t yours, but you came up with a technological solution that makes it work&#8212;that&#8217;s OK, join the team. The worst failure mode is convincing yourself you like someone else&#8217;s idea on logical merits when you&#8217;re not passionate about it. If you&#8217;re going to be a mercenary, be honest about it&#8212;and love the craft of being a mercenary.</p></li><li><p><strong>The idea isn&#8217;t well envisioned.</strong> It&#8217;s a fleeting, fuzzy thought. The world can&#8217;t materialize a fuzzy image. This failure mode often happens to non-technical people or when we simply don&#8217;t yet have a necessary background to be able to imagine the end product in a precise enough detail.</p></li><li><p><strong>It&#8217;s purely cerebral.</strong> A product born only from logic usually isn&#8217;t art. You need conviction that lives below the neck.</p></li></ul><h2>2. Test it</h2><p>Pitch it to your smartest friends, as well as to friends not in the field. Stress&#8209;test it from different angles. Run experiments to <strong>prove or disprove</strong> your thinking. Is the idea actually new in the first place or you just didn&#8217;t bother to do basic research? Think like an investor; evaluate it as you would an investment pitch. This is the logical part. Is the technology feasible? Is the timeframe right? You have to allow some uncertainty, perhaps a lot of uncertainty. You will hear many &#8220;no&#8221;s and there will never be enough data. But there should be a threshold of probability you can settle on&#8212;and then decide.</p><p><strong>Where this can go wrong:</strong></p><ul><li><p><strong>No serious scrutiny.</strong> Less of an issue with startups, but a very common failure mode in large companies. Apply VC&#8209;type scrutiny: explicit assumptions (why, why now, why this team, how is it different, what category are we creating or capturing, how do we solve the distribution). Even for small projects - do this, it can be done very quickly with modern tools.</p></li><li><p><strong>Never&#8209;ending search for 100% conviction.</strong> It doesn&#8217;t exist. Set the risk level you can carry and move.</p></li></ul><h2>3. Focus on it</h2><p>This is about <strong>compelling the world</strong> to accept your vision. It requires assembling as many smart, values&#8209;aligned people as you can and maintaining persistent focus on execution. This is also the place where much traditional advice can work&#8212;to an extent.</p><p><strong>Where this can go wrong:</strong></p><ul><li><p><strong>Insufficient critical mass of smart people</strong>, or settling for the wrong, mediocre team (or a team of mercenaries) that isn&#8217;t excited or hasn&#8217;t adopted the idea almost as a new religion. I have yet to hear from anyone that they have a mediocre team (&#8220;we have a very high bar clich&#233;&#8221;). Be honest, is it really true? Most teams in fact are mediocre or worse.</p></li><li><p><strong>Insufficient funding. </strong>It is perhaps a sad truth - but funding is a competitive differentiation for startups. Inadequate capital could impair both the ambition and execution when the speed matters most.</p></li><li><p><strong>Loss of focus</strong> during execution. Could happen for variety of reasons - loss of conviction on a part of the team, poor growth execution / team dilution (by hiring too many or wrong people too fast), sometimes a function of too much funding.</p></li><li><p><strong>Bailing too early</strong>&#8212;not giving the idea enough time to work. This one is hard as you never know for sure.</p></li></ul><div><hr></div><p>I&#8217;m writing this for myself&#8212;as a reminder&#8212;as much as for you. It&#8217;s incredibly easy to fall into these traps.</p><p><strong>Imagine clearly. Test honestly. Focus completely.<br>Compel the world.</strong></p><p>Love to hear your thoughts.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Agents That Feel the Room—and Fix Themselves]]></title><description><![CDATA[Where agentic AI is headed, and why intelligent data flywheels matter]]></description><link>https://www.equationblog.com/p/agents-that-feel-the-roomand-fix</link><guid isPermaLink="false">https://www.equationblog.com/p/agents-that-feel-the-roomand-fix</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sat, 25 Oct 2025 14:15:25 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-MQk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>If you watch today&#8217;s agents for more than a few minutes, you see the mood swings. In one chat they&#8217;re warm and crisp; in the next they over&#8209;explain, miss an obvious cue, or bungle a tool call with the wrong parameter. And even when there&#8217;s no &#8220;human feel&#8221; at all&#8212;just a heads down task&#8212;they drift off spec: skip required steps, optimize the wrong objective, or simply neglect or misinterpret human inputs. </p><p>I don&#8217;t think this is a &#8220;just scale it&#8221; problem. It&#8217;s a feedback problem. We&#8217;re still treating behavior&#8212;tone, choices, task adherence, even API hygiene&#8212;like vibes in a prompt instead of something we can observe, grade, and steadily improve. That&#8217;s why I keep coming back to intelligent data flywheels: small, concrete loops that turn context &#8596; behavior trace into a living artifact for training, evaluation, and live steering.</p><h3>What I mean by &#8220;intelligent data flywheels&#8221;</h3><p>The picture in my head is simple. Keep a <strong>map</strong> of the situations your agent encounters (support vs. sales, calm vs. upset, low&#8209;stakes vs. high&#8209;stakes) and the <strong>behaviors</strong> you want in each (clarity, empathy, brevity, caution, brand voice and values). Use that map to (a) <strong>generate</strong> realistic multi&#8209;turn data&#8212;by having a &#8220;Human LLM&#8221; act out the human side of conversations&#8212;and (b) <strong>judge</strong> the agent against the behavior playbook with a grader that you also train. Then run the same primitives at three speeds: <em><strong>observability</strong></em> (find where it breaks or excel), <em><strong>inference&#8209;time nudging</strong></em> (steer it right now), and <em><strong>training</strong></em> (make the fix stick). When human raters and the judge disagree beyond a reasonable margin, you update the judge; when they align, you let the judge carry more load and feed the next enhancement round. <strong>That&#8217;s the loop</strong>.</p><p>I still care a lot about <em>emotional intelligence</em>&#8212;human-facing agents that communicate across different media and modalities&#8212;but the same flywheel helps with boring, high&#8209;impact stuff that isn&#8217;t &#8220;EQ&#8221; at all: tool use, retrieval grounding, latency/cost tradeoffs, and safety drift.</p><h3>Why this suddenly feels practical</h3><p>A few research threads clicked into place:</p><ul><li><p><strong>Principle&#8209;following reward models</strong> showed you can align behavior to a collection of human-written rubrics instead of massive preference sets</p></li><li><p><strong>Inference&#8209;time scaling for judges</strong> matured (e.g. DeepSeek&#8217;s GRM). Spend more tests&#8209;time compute&#8212;parallel samples plus a meta&#8209;judge&#8212;and you get more reliable reward signals for both training and live guardrails.</p></li><li><p><strong>Judges that actually reason</strong> generate a case-specific rubric before scoring, which humans can verify, align, and generalize..</p></li><li><p><strong>Open, strong preference data</strong> (e.g. helpsteer3) and <strong>closed, experiential data from your deployed agent</strong> provides a solid base you can specialize with your own behavior map.</p></li></ul><p><strong>Caveat</strong>: LLM&#8209;as&#8209;a&#8209;judge isn&#8217;t magic. Benchmarks like JudgeBench show judges can be brittle or biased if you don&#8217;t treat them like first&#8209;class products&#8212;versioned, monitored, retrained, and contexted.. That&#8217;s another reason to put them inside the flywheel.</p><h3>The unglamorous stuff agents fail at</h3><ul><li><p><strong>Function calling &amp; schema correctness.</strong> Even top models still fumble basic format rules (quote this string; ISO date there) and multi&#8209;step tool chains. Recent work&#8212;BFCL, JSONSchemaBench, IFEval&#8209;FC&#8212;quantifies how often calls are syntactically valid yet semantically wrong. In my head, the &#8220;judge&#8221; can be a schema/trace checker with scenario&#8209;aware penalties, and the generator can synthesize tricky, long&#8209;horizon tool graphs to close the gap.</p></li><li><p><strong>Grounding &amp; hallucinations in RAG.</strong> Datasets like RAGTruth and newer lenses like HalluLens keep reminding us that extrinsic hallucinations haven&#8217;t vanished; high&#8209;certainty hallucinations are especially sneaky. A flywheel can grade answers on entailment against retrieved context and choose the next hard cases to label or synthesize.</p></li><li><p><strong>Open&#8209;world task reliability.</strong> Real agent work looks like OS&#8209;level workflows and the messy web. OSWorld, WebArena, and AgentBench have moved the bar here and highlight recurring failure modes&#8212;state tracking, planning depth, visual grounding. Using their task taxonomies as &#8220;contexts&#8221; and step&#8209;level success as &#8220;behaviors&#8221; gives you a clean contract for the flywheel to optimize.</p></li><li><p><strong>Safety and the &#8220;persona dial.&#8221;</strong> OpenAI&#8217;s emergent&#8209;misalignment results point to interpretable <em>persona features</em>&#8212;directions in activation space that modulate toxic or deceptive modes. That turns safety from a black box into a dial you can monitor and counter&#8209;steer inside the flywheel.</p></li></ul><p>And yes, <strong>EQ still matters</strong> because humans are in the loop. Benchmarks like EmoBench, EmotionQueen, and multimodal EmoBench&#8209;M show a persistent gap to humans on &#8220;understand + respond appropriately.&#8221; That&#8217;s the sweet spot for a behavior&#8209;by&#8209;context map coupled to a judge that also reasons about emotion.</p><h3>How this looks in practice&#8212;my mental movie</h3><p>I picture an analytics view that doesn&#8217;t just say &#8220;CSAT dropped,&#8221; but <em>where and why</em>: &#8220;In escalations from upset users, brevity overrode clarity; schema errors spiked in step&#8209;3 tool calls; the judge drifted on empathy.&#8221; From there, the loop suggests more of what it needs: a batch of synthetic escalations with complicated tool chains; a judge tune on Emotion&#8209;Application items; a tweak to the behavior weights for this scenario. We close the loop in three places: surface the issue (observability), compensate now (inference), and make it permanent (training). Publish the change, re&#8209;benchmark, repeat. </p><p>Over time you get a system that not only <em>thinks</em> better, but <em>behaves</em> predictably under stress&#8212;because behavior stopped being vibes and started being data.</p><p>And one more thing: over time, those results compound into a differentiated, market-adapting playbook &#8212; a living operational memory co-built by your team and the system, shaped by every success, failure, and fix.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p>]]></content:encoded></item><item><title><![CDATA[Is code still the source of truth?]]></title><description><![CDATA[Summary of my presentation at 2025 ELC Annual]]></description><link>https://www.equationblog.com/p/is-code-still-the-source-of-truth</link><guid isPermaLink="false">https://www.equationblog.com/p/is-code-still-the-source-of-truth</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Wed, 15 Oct 2025 14:15:48 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!37ed!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>I argued at <a href="https://youtu.be/y6M-SCOhnfE?si=Q5_LBSFsRbrGqoqj">ELC Annual</a> that the center of gravity is drifting&#8212;from code to <em>inputs about the code</em>. Prompts, datasets, models, and evals are becoming the primary artifacts. That&#8217;s not a prediction; it&#8217;s merely connecting the dots <em>backward</em> and noticing what&#8217;s already changed.</p><div class="native-video-embed" data-component-name="VideoPlaceholder" data-attrs="{&quot;mediaUploadId&quot;:&quot;3b5bef46-7b41-4c50-a547-16223e09c54f&quot;,&quot;duration&quot;:null}"></div><div><hr></div><h3>The constraints we actually fight</h3><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>Most of engineering is just pressure-testing three constraints: <strong>human cognition</strong> (complexity, coordination, bugs), <strong>provisioning at scale</strong> (deploy, redundancy, cost), and <strong>change risk</strong> (feedback loops). The tools that moved the needle before LLMs&#8212;managed runtimes, DVCS+CI/CD, containerization/cloud, observability&#8212;were all bets against those constraints.</p><div><hr></div><h3>What shifted with LLMs</h3><p>Treating the model like a &#8220;compiler&#8221; works&#8212;until it doesn&#8217;t. Traditional compilers are deterministic; auto&#8209;regressive models aren&#8217;t. Tiny prompt edits (or swaps between model families) yield materially different outcomes. That puts <em>inputs</em>&#8212;prompts, retrieval/context rules, tool schemas&#8212;and <em>evaluation</em> at the center. The inputs are the product.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!37ed!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!37ed!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png 424w, https://substackcdn.com/image/fetch/$s_!37ed!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png 848w, https://substackcdn.com/image/fetch/$s_!37ed!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png 1272w, https://substackcdn.com/image/fetch/$s_!37ed!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!37ed!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png" width="1456" height="1018" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1018,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:177727,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://www.equationblog.com/i/175477719?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!37ed!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png 424w, https://substackcdn.com/image/fetch/$s_!37ed!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png 848w, https://substackcdn.com/image/fetch/$s_!37ed!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png 1272w, https://substackcdn.com/image/fetch/$s_!37ed!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe693e6-8ace-42a5-954f-7eeb50a5a876_1617x1131.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>Two practical implications:</p><ol><li><p><strong>Inputs as first&#8209;class artifacts.</strong><br>Prompts (and flows), input datasets (docs, tickets, logs), models/fine&#8209;tunes, and eval suites/user simulators all need real versioning, lineage, and regression checks&#8212;because models change under your feet.</p></li><li><p><strong>Evaluation is your safety rail.</strong><br>You won&#8217;t guarantee determinism; you <em>can</em> bound behavior and catch drift. Invest in evals and forward simulation before you put agents anywhere near money or production.</p></li></ol><div><hr></div><h3>Legacy, repos&#8212;and the new &#8220;rewrite&#8221;</h3><p>&#8220;Full rewrite&#8221; used to be a dirty word. With LLM&#8209;accelerated throughput, wholesale rewrites are increasingly viable when entropy makes understanding costlier than regeneration + hardening with evals. </p><div><hr></div><h3>What engineering is becoming</h3><p>Zoom out and the job collapses into <strong>tooling</strong> and <strong>data</strong>.</p><p>Humans become what the models aren&#8217;t:  <strong>(a) evaluators</strong> and <strong>(b) carriers of undocumented institutional knowledge</strong>. That&#8217;s where leverage lives.</p><blockquote><p><strong>Tech debt is dirty data.</strong><br>Clean it, or it compounds.</p></blockquote><h3>A minimal operating checklist (until better tooling is available):</h3><ul><li><p><strong>Check in prompts with code.</strong> Include tool schemas, context rules, guardrails, and tests.</p></li><li><p><strong>Pin and record models.</strong> Track families/versions and fine&#8209;tuning metadata like compiler flags. Expect drift.</p></li><li><p><strong>Build evals before features.</strong> Scenarios, simulators, acceptance criteria&#8212;gate releases on them.</p></li><li><p><strong>Prefer rewrite when entropy wins.</strong> If understanding cost &gt; regeneration + eval hardening, start over.</p></li><li><p><strong>Instrument everything.</strong> You can&#8217;t lead probabilistic systems blind.</p></li></ul><p>None of this requires prophecy (Bohr and Feynman would approve);  tempered by the reminder that auto&#8209;regressive generation diverges without control. </p><p>The link to the full <a href="https://youtu.be/y6M-SCOhnfE?si=kpKoXJiL-BxIDwpE">video</a></p><p>&#8212; <strong>Ruslan</strong></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Updated workout routine - Part 2/2]]></title><description><![CDATA[6-weeks functional block]]></description><link>https://www.equationblog.com/p/updated-workout-routine-part-22</link><guid isPermaLink="false">https://www.equationblog.com/p/updated-workout-routine-part-22</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sat, 11 Oct 2025 14:15:37 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-MQk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>&#8220;I am good at pullups&#8221;</em>  &#8212;Tony Horton</p><div><hr></div><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>As mentioned in the previous post - I&#8217;ve recently tweaked my workout schedule to harmonize the functional block (focused on full body power movements) and weight lifting block (focused on muscle building) to make them both be 6-weeks in length each. I alternate those blocks.</p><p>In my <a href="https://www.equationblog.com/p/my-current-workout-routine">previous</a> post I described the prior routine: two blocks &#8212; a 6-week full-body (mostly power-focused) functional block, followed by a 4-week weight-lifting block.</p><p>In this post, I&#8217;m outlining a modified functional block. This is the hardest block due to high demand on neuro-musculature system (even though the weights are considerably lower) and the follow on weight lifting block always feels like a break.</p><p>As before, I work out first thing in the morning (work scheduling), with a 5:30 am wake-up, quick oral/face hygiene, pre-workout <a href="https://www.equationblog.com/p/the-supplements">supplements</a>, and a 10-15 min meditation. </p><p>Because the blocks are now a bit longer, I&#8217;m skipping pre-workout foam rolling during the work week. If I had unlimited time, I&#8217;d keep it.</p><p>I still use guided workouts (<a href="https://www.bodi.com/us/en/s/subscription?code=SEMB_BODI_GOOGLE&amp;g_acctid=821-296-1770&amp;g_adgroupid=166824054903&amp;g_adid=733099013197&amp;g_adtype=search&amp;g_campaign=SEM_Brand_All&amp;g_campaignid=21755424360&amp;g_keyword=beachbody&amp;g_keywordid=kwd-44858520&amp;g_network=g&amp;gad_campaignid=21755424360&amp;gad_source=1&amp;gclid=CjwKCAjwkvbEBhApEiwAKUz6-2h6-q4czV8nmyhecXe2sn_CuI1tjw6RDjhU3QcsCBlzqMjexYOB5BoCG38QAvD_BwE&amp;gclsrc=aw.ds&amp;nb_adtype=&amp;nb_ap=&amp;nb_fii=&amp;nb_kwd=beachbody&amp;nb_li_ms=&amp;nb_lp_ms=&amp;nb_mi=&amp;nb_mt=e&amp;nb_pc=&amp;nb_pi=&amp;nb_placement=&amp;nb_ppi=&amp;nb_si=%7Bsourceid%7D&amp;nb_ti=kwd-44858520&amp;nbt=nb%3Aadwords%3Ag%3A21755424360%3A166824054903%3A733099013197&amp;utm_campaign=SEM_Brand_All&amp;utm_content=21755424360_166824054903&amp;utm_medium=SEM&amp;utm_source=GOOGLE&amp;utm_term=beachbody_">Beachbody</a> &#8212; historical) mainly for timing and pacing.</p><p>The core workout is built around <a href="https://www.beachbodyondemand.com/programs/6-weeks-of-the-work/start-here?locale=en_US">The 6-weeks of the work</a> program by Amolia Ceasar and it consists of 3 blocks, 2 weeks each. The workouts get progressively more complex (although not necessarily harder) in each subsequent block. Since the program has a formal rest day and a really easy 20-25m stretch routine day, I simply skip the rest day and tag on the stretching routine to the tail end of the last workout of the week. So the program&#8217;s 2 weeks end up being 10 days for me. To make it still be 6 calendar weeks I simply repeat week 1 or week 5 in the end (week 1 is actually one of the hardest weeks if you push the weights up). For reference - I am sharing my workout sheets <a href="https://docs.google.com/document/d/15WEBWvQ095jwGwg1DS_jHmzMD2OrZr9DgTgV7P7VjFA/edit?usp=sharing">here</a>.</p><p>Below are two blocks each matching alternating week in the program.</p><p>Weeks 1/3/5:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\begin{array}{c c c c}\n\\textbf{Core workout} &amp; \\textbf{Supplemental 1} &amp; \\textbf{Supplemental 2} &amp; \\textbf{Supplemental 3} \\\\\n\\hline\nPush &amp; 10m\\ Ab\\ Hummer  &amp; Short\\ Intervals  &amp; 30m \\ Zone\\ 2 \\\\\nLegs &amp; P90X3\\ Challenge  &amp; 30m \\ Zone\\ 2 \\\\\nEndurance\\ \\&amp;\\ Agility &amp; P90X\\ Ab\\ Ripper  &amp; 50m \\ Zone\\ 2 \\\\\nPull &amp; IM30\\ 10m\\ Abs &amp; Long\\ Intervals &amp; 30m \\ Zone\\ 2 \\\\\nFull\\ Body\\ Tempo &amp; Range\\ \\&amp;\\ Repair &amp; 50m \\ Zone\\ 2 \\\\\n\n\\end{array}&quot;,&quot;id&quot;:&quot;GGMEAQIFIO&quot;}" data-component-name="LatexBlockToDOM"></div><p>Weeks 2/4/6:</p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\begin{array}{c c c c}\n\\textbf{Core workout} &amp; \\textbf{Supplemental 1} &amp; \\textbf{Supplemental 2} &amp; \\textbf{Supplemental 3} \\\\\n\\hline\nTotal\\ Body\\ Push\\ Pull &amp; 10m\\ Ab\\ Hummer  &amp; Short\\ Intervals  &amp; 30m \\ Zone\\ 2 \\\\\nStrength\\ \\&amp;\\ Power  &amp; 50m \\ Zone\\ 2 \\\\\nCardio\\ \\&amp;\\ Core &amp; P90X3\\ Challenge  &amp; 30m \\ Zone\\ 2 \\\\\nIsometrics &amp; IM30\\ 10m\\ Abs &amp; Long\\ Intervals &amp; 30m \\ Zone\\ 2 \\\\\nCrucible &amp; Range\\ \\&amp;\\ Repair &amp; 50m \\ Zone\\ 2 \\\\\n\n\\end{array}&quot;,&quot;id&quot;:&quot;LJIUQCJXIC&quot;}" data-component-name="LatexBlockToDOM"></div><p></p><ul><li><p><a href="https://www.beachbodyondemand.com/programs/p90x3/start-here?locale=en_US">P90X3</a> Challenge consists of four blocks. Each block has two sets of pull-ups followed by push-ups (different variations). My current numbers are: 15 pull-ups, 40 push-ups per set. That&#8217;s 120 pull-ups and 320 push-ups per workout. There&#8217;s also a burnout at the end &#8212; 2 pull-ups and 4 push-ups, no rest, for 6 sets &#8212; so add 12 pull-ups and 24 push-ups to the total.</p></li><li><p>10min Ab Hummer is from <a href="https://www.beachbodyondemand.com/programs/the-masters-hammer-and-chisel/start-here?locale=en_US">Hammer and Chisel</a> program, P90X Ab Ripper is from <a href="https://www.bodi.com/us/en/s/fitness/p90x?code=Shopping_BOD_SMART_GOOGLE&amp;utm_campaign=BB_US_PLA_GGL_PerformanceMax_ProgramsBR&amp;utm_content=21698947640_&amp;utm_term=&amp;g_acctid=821-296-1770&amp;g_adgroupid=&amp;g_adid=&amp;g_adtype=none&amp;g_campaign=BB_US_PLA_GGL_PerformanceMax_ProgramsBR&amp;g_campaignid=21698947640&amp;g_keyword=&amp;g_keywordid=&amp;g_network=x&amp;gclsrc=aw.ds&amp;nb_adtype=&amp;nb_ap=&amp;nb_fii=&amp;nb_kwd=&amp;nb_li_ms=&amp;nb_lp_ms=&amp;nb_mi=&amp;nb_mt=&amp;nb_pc=&amp;nb_pi=&amp;nb_placement=&amp;nb_ppi=&amp;nb_si={sourceid}&amp;nb_ti=&amp;nbt=nb%3Aadwords%3Ax%3A21698947640%3A%3A&amp;gad_source=1&amp;gad_campaignid=21692483691&amp;gclid=CjwKCAjwuePGBhBZEiwAIGCVS_Gq1bOqaavHimcR4HvsA7h1UhD3I5tu3ZGynnrRMKpyVGTYCBIhpRoCZmoQAvD_BwE">P90X</a> and 10min Abs is from <a href="https://www.beachbodyondemand.com/programs/insanity-max30/start-here?locale=en_US">Insanity Max 30.</a></p></li></ul><p><strong>Intervals</strong></p><ul><li><p><strong>Long intervals:</strong> 3-min warm-up jog; 3 &#215; 5-min runs with 2-min walks in between; cool-down jog/walk.</p></li><li><p><strong>Short intervals:</strong> 3-min warm-up jog; 9 &#215; 1-min max-effort sprints with 1-min walks; cool-down jog/walk.</p></li></ul><ul><li><p><strong>When time is short</strong>, abs get dropped first. If Zone 2 is 50 min, I cut it to 30; if Zone 2 follows intervals, I drop it.</p></li></ul><p>The form is critical for these workouts, especially mental fatigue sets in. Seemingly lower weights can be deceptive - do not raise the weights unless you can complete the exercise with nearly a perfect form, as the risk of injury with complex power moves goes up considerably.</p><p><strong>Note</strong>: doing exercises with a perfect form is an easy way to enhance effectiveness  with lower weights and much reduced risk of injury.</p><p><strong>When traveling</strong></p><p>When traveling and the hotel gym with weights is available, both circuits can continue, depending on the space and how crowded the place is. When no gym is available I normally switch to Insanity Max 30 Month Two workouts for the duration. Those are body weight only and while not sufficient by themselves, they are decent enough for maintenance while traveling.</p><p>Again - this is what I do. Consistency and a purposeful plan are keys and there are many other ways to stay in shape.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Updated workout routine - Part 1/2]]></title><description><![CDATA[&#8220;Key to performance: fit the benchmark.&#8221; &#8212;unspoken rule at major AI labs]]></description><link>https://www.equationblog.com/p/updated-workout-routine-part-12</link><guid isPermaLink="false">https://www.equationblog.com/p/updated-workout-routine-part-12</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sat, 23 Aug 2025 14:15:23 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-MQk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>&#8220;Key to performance: fit the benchmark.&#8221;</em>  &#8212;unspoken rule at major AI labs</p><div><hr></div><p><a href="https://my.clevelandclinic.org/health/diagnostics/10683-dexa-dxa-scan-bone-density-test">DEXA</a> Update (August, 2025):</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!r5dI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!r5dI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png 424w, https://substackcdn.com/image/fetch/$s_!r5dI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png 848w, https://substackcdn.com/image/fetch/$s_!r5dI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png 1272w, https://substackcdn.com/image/fetch/$s_!r5dI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!r5dI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png" width="1456" height="171" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:171,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:61973,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.equationblog.com/i/170830684?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!r5dI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png 424w, https://substackcdn.com/image/fetch/$s_!r5dI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png 848w, https://substackcdn.com/image/fetch/$s_!r5dI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png 1272w, https://substackcdn.com/image/fetch/$s_!r5dI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa225d11-33fc-42c9-90da-f82391a9e3d1_1912x224.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><p>As you can see - total body fat went up by 3.3 pounds, while total lean mass went up by 4.4 pounds since the last scan. As you can see - it is actually very difficult to increase lean mass without gaining some fat mass.</p><div><hr></div><p>I&#8217;ve recently tweaked my workout schedule, shifting more toward muscle and <a href="https://en.wikipedia.org/wiki/VO2_max">VO2max</a> maintenance.</p><p>In my <a href="https://www.equationblog.com/p/my-current-workout-routine">previous</a> post I described the prior routine: two blocks &#8212; a 6-week full-body (mostly power-focused) functional block, followed by a 4-week weight-lifting block.</p><p>In this post, I&#8217;m outlining a modified weight-lifting block that&#8217;s now 6 weeks to match the functional power block.</p><p>As before, I work out first thing in the morning (work scheduling), with a 5:30 am wake-up, quick oral/face hygiene, pre-workout <a href="https://www.equationblog.com/p/the-supplements">supplements</a>, and a 10-15 min meditation. </p><p>Because the blocks are a bit longer, I&#8217;m skipping pre-workout foam rolling during the work week. If I had unlimited time, I&#8217;d keep it.</p><p>I still use guided workouts (<a href="https://www.bodi.com/us/en/s/subscription?code=SEMB_BODI_GOOGLE&amp;g_acctid=821-296-1770&amp;g_adgroupid=166824054903&amp;g_adid=733099013197&amp;g_adtype=search&amp;g_campaign=SEM_Brand_All&amp;g_campaignid=21755424360&amp;g_keyword=beachbody&amp;g_keywordid=kwd-44858520&amp;g_network=g&amp;gad_campaignid=21755424360&amp;gad_source=1&amp;gclid=CjwKCAjwkvbEBhApEiwAKUz6-2h6-q4czV8nmyhecXe2sn_CuI1tjw6RDjhU3QcsCBlzqMjexYOB5BoCG38QAvD_BwE&amp;gclsrc=aw.ds&amp;nb_adtype=&amp;nb_ap=&amp;nb_fii=&amp;nb_kwd=beachbody&amp;nb_li_ms=&amp;nb_lp_ms=&amp;nb_mi=&amp;nb_mt=e&amp;nb_pc=&amp;nb_pi=&amp;nb_placement=&amp;nb_ppi=&amp;nb_si=%7Bsourceid%7D&amp;nb_ti=kwd-44858520&amp;nbt=nb%3Aadwords%3Ag%3A21755424360%3A166824054903%3A733099013197&amp;utm_campaign=SEM_Brand_All&amp;utm_content=21755424360_166824054903&amp;utm_medium=SEM&amp;utm_source=GOOGLE&amp;utm_term=beachbody_">Beachbody</a> &#8212; historical) mainly for timing and pacing, and my week starts on <strong>Sundays</strong>.</p><p>The general structure: a <strong>weight-lifting segment followed by a cardio segment</strong> (where I listen to podcasts or other educational audio).</p><p></p><div class="latex-rendered" data-attrs="{&quot;persistentExpression&quot;:&quot;\\begin{array}{c c c c}\n\\textbf{Core workout} &amp; \\textbf{Supplemental 1} &amp; \\textbf{Supplemental 2} &amp; \\textbf{Supplemental 3} \\\\\n\\hline\nBuild\\ Chest\\ \\&amp;\\ Tries &amp; Beast\\ Abs\\ Classic  &amp; Short\\ Intervals  &amp; 30m \\ Zone\\ 2 \\\\\nBuild\\ or\\ Bulk\\ Legs &amp; P90X3\\ Challenge  &amp; 30m \\ Zone\\ 2 \\\\\nBuild\\ Shoulders &amp; Beast\\ Abs  &amp; 50m \\ Zone\\ 2 \\\\\nBuild\\ Back\\ \\&amp;\\ Biceps &amp; 50m \\ Zone\\ 2 \\\\\nBulk\\ Chest &amp; Beast\\ Abs &amp; Long\\ Intervals &amp; 30m \\ Zone\\ 2 \\\\\nBulk\\ Arms &amp; T30\\ Pull &amp; 50m \\ Zone\\ 2 \\\\\nBulk\\ Shoulders &amp; T30\\ Chin\\ ups &amp; IM30\\ 10m\\ Abs&amp; 50m \\ Zone\\ 2 \\\\\n\n\\end{array}&quot;,&quot;id&quot;:&quot;LFBENJWANL&quot;}" data-component-name="LatexBlockToDOM"></div><p></p><ul><li><p>The core workouts are built around <a href="https://www.beachbodyondemand.com/programs/body-beast/start-here?locale=en_US">Body Beast</a> workout with additional components added on.</p></li><li><p><a href="https://www.beachbodyondemand.com/programs/p90x3/start-here?locale=en_US">P90X3</a> Challenge consists of four blocks. Each block has two sets of pull-ups followed by push-ups (different variations). My current numbers are: 15 pull-ups, 40 push-ups per set. That&#8217;s 120 pull-ups and 320 push-ups per workout. There&#8217;s also a burnout at the end &#8212; 2 pull-ups and 4 push-ups, no rest, for 6 sets &#8212; so add 12 pull-ups and 24 push-ups to the total.</p></li><li><p><a href="https://www.beachbodyondemand.com/programs/tough-mudder-t-minus-30/start-here?locale=en_US">T-30 </a>Chin-up and Pull-up routines are short (~12m) 2X chin-up/pull-up ladders followed by hanging leg raises, monkey bar simulation and a burn-out. My ladder is usually 2-4-8-16 (30 total in each ladder).</p></li></ul><p><strong>When time is short</strong>, abs get dropped first. If Zone 2 is 50 min, I cut it to 30; if Zone 2 follows intervals, I drop it.</p><p><strong>Intervals</strong></p><ul><li><p><strong>Long intervals:</strong> 3-min warm-up jog; 3 &#215; 5-min runs with 2-min walks between; cool-down jog/walk.</p></li><li><p><strong>Short intervals:</strong> 3-min warm-up jog; 9 &#215; 1-min max-effort sprints with 1-min walks; cool-down jog/walk.</p></li></ul><p><strong>Notes on the weight-lifting pattern</strong></p><p>Standard sets follow a <strong>light&#8211;medium&#8211;heavy&#8211;drop</strong>-set pattern.</p><ul><li><p><strong>Light set (15 reps):</strong> warm-up for the muscle group and a chance to fine-tune the form.</p></li><li><p><strong>Medium set (&#8776;12 reps):</strong> calibration (heavy but not maximal). Sometimes I skip medium and go straight to a heavy 8-rep set, then a second heavy set after 45&#8211;60 s rest.</p></li><li><p><strong>Heavy set (6&#8211;8 reps):</strong> stop 1&#8211;2 reps shy of failure, immediately followed by a drop set at medium weight for another 6&#8211;8 reps.</p></li></ul><p>There are no true power moves in this block, but I often simulate them by moving fast on the concentric and slowing down on the eccentric (e.g., a rapid pull-up followed by a slow controlled descent).</p><p><strong>Note 1</strong>: doing exercises with a perfect form is an easy way to enhance effectiveness  with lower weights and much reduced risk of injury.</p><p><strong>Note 2</strong>: shoulder upright row is bio mechanically not a great exercise as it puts shoulders into an impingement position (EZ-bar or not), so I replace it with a <strong>dumbbell high pull</strong> as the closest alternative.</p><p>As always&#8212;if the last set is too easy, add a few extra reps; next time, increase the weight. The key to performance is sufficient stimulus.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Why Inverting AI Workflows Is Key to Enterprise Accuracy]]></title><description><![CDATA[How leveraging human judgment and reversing traditional AI workflows can overcome dirty data challenges.]]></description><link>https://www.equationblog.com/p/why-inverting-ai-workflows-is-key</link><guid isPermaLink="false">https://www.equationblog.com/p/why-inverting-ai-workflows-is-key</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sun, 08 Dec 2024 15:15:47 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!SSKm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>In this post, I want to highlight a persistent challenge that often stalls efforts to improve AI accuracy in enterprise environments. While many acknowledge that insufficient data can be a limiting factor&#8212;especially when proprietary information is needed to enhance a general-purpose large language model (LLM)&#8212;the real issue goes beyond mere scarcity. To achieve top-tier domain performance, enterprises may need to fine-tune LLMs so that they internalize specialized knowledge and context. Unlike Retrieval-Augmented Generation (RAG) or prompt-based methods, which can inject additional information at inference time without altering the model&#8217;s parameters, fine-tuning permanently integrates the provided training data into the model&#8217;s internal representations.</p><p>This direct integration makes data quality crucial. Fine-tuning can deliver significant improvements in accuracy and domain alignment, but only if the training data is both reliable and relevant. Unfortunately, many enterprise datasets&#8212;sourced from CRMs, ERPs, and other operational systems&#8212;are incomplete, inconsistent, and lack clear indicators of quality. These repositories often contain &#8220;dirty&#8221; data that, even after cleaning, remains difficult to differentiate in terms of importance or strategic value. Merely connecting an LLM to these data sources, hoping they will serve as a source of refined knowledge, is insufficient. Without a methodology to identify which subsets truly matter, organizations risk overwhelming the model with undifferentiated content, ultimately diluting its domain expertise rather than enhancing it.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>Only with robust frameworks for scoring, filtering, and weighting data can organizations ensure that their fine-tuned LLMs prioritize the most accurate, timely, and contextually relevant information. In the absence of these data differentiation strategies, the promise of fine-tuning remains out of reach, and valuable proprietary knowledge fails to translate into tangible performance gains for the AI model.</p><h2><strong>An Even Bigger Problem: Human Knowledge Gaps</strong></h2><p>Even if enterprise data could be perfectly cleaned, quantified and integrated, it still wouldn&#8217;t guarantee optimal AI-driven decisions. A significant portion of critical knowledge remains locked in human minds&#8212;varied in expertise, judgment, and subjectivity. This disparity is why organizations hold meetings, rely on structured decision-making processes, and depend on human interpretation to bridge data with hopefully sound judgment.</p><p>To overcome this challenge, enterprises must rethink their AI workflows. Rather than AI serving as a co-pilot to human decision-makers, it should take the role of the pilot. In this model, humans shift from being the central drivers of decision-making to acting as data providers or experts who contribute insights only when the AI specifically requests them. The ideal scenario minimizes human intervention, enabling the AI to steer autonomously.</p><p>Extending the aviation analogy, this transition must be continuous. Instead of a static handover, the AI system should operate within a near real-time decision loop, constantly refining its approach based on new information and feedback. In doing so, the enterprise moves toward a state where human judgment is available on demand, but rarely required at the forefront.</p><div><hr></div><h2><strong>A Real-World Example: Customer Service Chatbots</strong></h2><p>Let&#8217;s consider a simple but real-world scenario: a customer service chatbot. Imagine this bot has access to all relevant information&#8212;customer databases, Jira tickets, FAQs, and documented workflows. For simplicity, assume it achieves perfect accuracy in intent detection.</p><p>What failure modes remain?</p><ol><li><p>The data is incomplete (e.g., it doesn&#8217;t know the answer).</p></li><li><p>The data is wrong (e.g., outdated FAQs or incorrect workflows).</p></li></ol><p>In these scenarios, the fallback is often to engage a human. This is illustrative because it highlights what the AI system should ideally do: engage humans strategically when gaps arise.</p><div><hr></div><h3><strong>Addressing Failure Scenarios</strong></h3><p>Two critical questions emerge:</p><ol><li><p><strong>What should happen next after a failure is detected and a human is engaged?</strong></p></li><li><p><strong>How could the failure have been avoided in the first place?</strong></p></li></ol><p>For the first question, the optimal response is to escalate the issue to a knowledgeable human (or multiple humans for high-stakes decisions), supported by the AI system in data lookup and verification. These escalated humans are likely not the same as the first-line responder&#8212;they must have the expertise to guide the system to the best resolution. Think of this as a "pager duty" for data management.</p><p> For the second question, prevention lies in robust simulations. For example, problem queries could be simulated using past customer interactions, exploring various branches of inquiry to proactively identify gaps in data or workflows.</p><p>This approach scales beyond customer service to more complex systems, such as financial modeling, supply chain management, and strategic decision-making.</p><div><hr></div><h2><strong>Scaling the Solution</strong></h2><p>Inverting traditional workflows&#8212;placing AI in the pilot&#8217;s seat and using humans as expert data sources&#8212;can fundamentally reshape enterprise operations. When properly implemented, this approach not only taps into the best of human expertise on demand, but also ensures that AI systems maintain efficiency, consistency, and domain relevance over time. Yet, to realize these benefits, organizations must couple this human-in-the-loop paradigm with robust data-centric strategies, particularly when it comes to fine-tuning AI models. A carefully curated and differentiated dataset ensures that the AI&#8217;s internal parameters remain aligned with business goals and processes and reflective of the most reliable, high-quality information.</p><p>For this approach to succeed, enterprises should:</p><ol><li><p><strong>Automatically Query Human Input:</strong><br>Treat humans as dynamic, on-call domain experts who provide clarifications, judgments, and domain-specific insights exactly when the situation demands. This continuous, as-needed engagement ensures that fine-tuned models incorporate high-value human knowledge without depending too heavily on subjective inputs.</p></li><li><p><strong>Simulate Task Inputs:</strong><br>Integrate continuous simulation runs. By proactively refining data quality and selection, the AI system stays prepared to handle new inputs effectively.</p></li><li><p><strong>Continuously Update Data and Models</strong><br>Implement iterative pipelines that not only refine data selection, weighting, and filtering, but also continuously adjust the model&#8217;s fine-tuning parameters as it encounters new scenarios.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!SSKm!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!SSKm!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!SSKm!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!SSKm!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!SSKm!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!SSKm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:496458,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!SSKm!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!SSKm!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!SSKm!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!SSKm!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff9f8f1cd-667b-4d51-9dc3-4f35c98e1acd_1024x1024.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>Enterprise AI systems can&#8217;t succeed if they&#8217;re treated as mere co-pilots. The future of AI in the enterprise requires rethinking workflows, prioritizing AI as the primary decision-maker, and leveraging humans as on-demand knowledge sources&#8212;brought into the loop only when their judgment or domain expertise is truly needed. This shift in perspective not only demands robust strategies for filtering, scoring, and fine-tuning proprietary data, but also calls for continuous refinement of how organizations manage, interpret, and integrate information into AI models.</p><p>By giving the AI system the reins, enterprises can better address the persistent issues of &#8220;dirty&#8221; data, the hidden complexities of human judgment, and the inherent challenges of operating at scale. The goal is not to diminish human input, but to transform it into a strategic resource that the AI can query as needed. This inversion&#8212;where humans assist AI, rather than the other way around&#8212;represents the logical next step in unlocking the full potential of AI within the future enterprise landscape.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Eating out hacks]]></title><description><![CDATA[Disclaimer: The following is based on my personal experience and is not intended as medical advice.]]></description><link>https://www.equationblog.com/p/eating-out-hacks</link><guid isPermaLink="false">https://www.equationblog.com/p/eating-out-hacks</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sat, 14 Sep 2024 14:15:58 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>Disclaimer:</strong> The following is based on my personal experience and is not intended as medical advice. Everyone's body and health situation are different, and what works for one person may not work for another. Always consult with a healthcare professional or physician before starting or changing any medication, supplement, or diet plan.</p><p>Let&#8217;s face it&#8212;we can&#8217;t always control what we eat, especially if we want to participate in business and social life.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>You eat at work, attend business or family functions, year-end parties, and let&#8217;s not even mention holidays and vacations. The energy balance equation can quickly get out of whack.</p><p>Let&#8217;s first address the topic everyone is talking about: GLP-1 receptor agonists. While I don&#8217;t have personal experience with them (don&#8217;t need it), I can see how they could be beneficial for many people. From what I understand, I don&#8217;t share many of the concerns about them. In biology, there&#8217;s no such thing as a "free lunch"&#8212;there&#8217;s always a risk/reward balance, and in the general case, being overweight and metabolically unhealthy presents a much higher risk.</p><p>GLP-1 agonists, such as <strong><a href="https://www.drugs.com/semaglutide.html">semaglutide</a></strong> (sold under brand names like Ozempic and Wegovy), <strong><a href="https://www.drugs.com/mtm/liraglutide.html">liraglutide</a></strong> (Saxenda), and <strong><a href="https://www.drugs.com/tirzepatide.html">tirzepatide</a></strong> (Mounjaro), have been used for years in the treatment of diabetes and off label in the body building community. These medications work by enhancing insulin secretion, slowing gastric emptying, and increasing satiety, leading to reduced food intake. The mechanism of action is well understood. However, the downsides are also known. One of the primary concerns is the loss of muscle mass along with fat, which can be a serious issue for sedentary individuals. So yes&#8212;if you&#8217;re considering using these medications, lift weights and eat plenty of protein.</p><p>There are some lesser-known issues, such as reduced heart rate variability (HRV), so I wouldn&#8217;t suggest casually using these drugs just to lose 10 pounds. But honestly, I see a lot of people who could have benefited from starting on them yesterday.</p><p>As for nutraceuticals, there are plenty of marketing claims about manipulating GLP-1 levels with probiotics, herbs, and other supplements. However, these claims are not supported by solid science. Mechanistically, to achieve the same effect as pharmaceutical GLP-1 agonists, you would need a 1,000-fold increase in the efficacy of these substances. So don&#8217;t bother.</p><p>The second approach is purely behavioral and has worked for me and many people I know&#8212;aligning your meal schedule with your circadian rhythm. This means eating three meals (breakfast, lunch, and dinner) at as consistent a time as possible each day, while absolutely avoiding any snacking between meals. Doing this helps prevent the constant need to eat by keeping your body in a more natural rhythm.</p><p>Regarding hunger between meals, there&#8217;s a remedy that can help (though, if you avoid snacking long enough, you will not need it). This involves using <strong>bitters</strong>, an ancient remedy believed to stimulate digestion and reduce appetite. Bitters work by activating bitter taste receptors in the mouth and gastrointestinal tract, which in turn stimulates the release of digestive enzymes and bile, potentially promoting a feeling of fullness or reducing cravings.</p><p>One specific supplement is <strong>Amarasate Extract</strong>, derived from New Zealand&#8217;s <strong>Gentian root</strong>. It contains bitter compounds that target receptors in the gut, signaling the brain to trigger satiety, which can help suppress appetite between meals. While these supplements aren&#8217;t highly effective for everyone, they can be a useful temporary hack until you get into the no-snacking habit.</p><p>Now, assuming you&#8217;ve got the basics under control but find yourself at a function with an overwhelming amount of tasty food (think authentic pasta dishes), self-control might not be enough&#8212;except for leaving early (which is often a good idea for other reasons).</p><p>First, <strong>front-loading with protein-rich foods</strong> when available (instead of carbs or alcohol) can help. This will increase satiety upfront, making you less likely to overeat.</p><p>Secondly, there are two pharmaceutical options you can consider:</p><ol><li><p><strong><a href="https://www.drugs.com/pro/orlistat-capsules.html">Orlistat</a></strong> blocks <strong>pancreatic lipase</strong>, the enzyme necessary to digest fats. The over-the-counter dose is 60 mg, which is mildly effective, while the prescription dose of 120 mg is more effective, reducing fat calorie absorption by about 30%. However, a warning: overuse will reliably lead to gastrointestinal side effects, especially diarrhea.</p></li><li><p><a href="https://www.drugs.com/mtm/acarbose.html">Acarbose</a> is a reversible alpha-glucosidase inhibitor that slows the breakdown of complex carbohydrates (such as pasta, rice, and bread) into simple sugars, delaying glucose absorption. Acarbose can lower postprandial blood glucose by about 20-40 mg/dL (closer to 20 mg/dL for me) per 25 mg dose, up to around 75 mg. Most of the calories from these carbs will eventually be absorbed, but some may pass into the large intestine, where they are fermented by gut bacteria. Beyond metabolic benefits, avoiding blood sugar spikes helps control hunger. Fun fact: Acarbose is only one of 3 drugs found to prolong lifespan in <a href="https://www.nia.nih.gov/research/dab/interventions-testing-program-itp">ITP</a> (Interventions Testing Program) studies (the other two being <a href="https://www.drugs.com/mtm/sirolimus.html">rapamycin</a> and <a href="https://www.drugs.com/drug-class/sglt-2-inhibitors.html">SGLT2 inhibitors</a>).</p></li></ol><p>Both of these medications must be taken at the right time&#8212;specifically with meals. The official recommendation is to take them with the first bite of food, but I&#8217;ve personally found they work a little better if you take them a few bites into the meal.</p><p>When it comes to simple sugars, that&#8217;s the toughest challenge. Unfortunately, there&#8217;s no highly effective remedy for handling them (aside from SGLT2 inhibitors, which wouldn&#8217;t qualify as a party hack). One supplement with some evidence, though not strong, is <strong>Gymnema Sylvestre extract</strong>. This herb contains compounds known as gymnemic acids, which are thought to work by blocking the sugar receptors on your taste buds, reducing the sweetness sensation and potentially curbing sugar cravings. It may also slow the absorption of sugar in the intestines by interacting with glucose transporters. However, while Gymnema Sylvestre might help a little if taken about 30 minutes before a meal, especially with sugary foods, it&#8217;s far from a magic bullet for controlling simple sugars.</p><p>In conclusion, these strategies can help you shave off some extra calories and reduce hunger, allowing you to get through meals with minimal damage.</p><p><em>Hope this helps and love to hear your feedback,</em></p><p><em>Ruslan</em></p><p><strong>Note</strong>: when I say something does or does not work for me, I rely on objective measurements, such as CGM in real-time, as well as calorie tracking (using <a href="https://cronometer.com/">Cronometer</a> app), calorie expenditure (using Apple Watch),  body composition (a scale as well as periodic DEXA scans).</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Differentiation in SaaS AI]]></title><description><![CDATA[A lot has happened in the last few months as the field has advanced (new models, new choices, lower prices), several companies have "inflected," and some things (CUDA dominance) have stayed the same.]]></description><link>https://www.equationblog.com/p/differentiation-in-saas-ai</link><guid isPermaLink="false">https://www.equationblog.com/p/differentiation-in-saas-ai</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Thu, 12 Sep 2024 17:30:04 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>A lot has happened in the last few months as the field has advanced (new models, new choices, lower prices), several companies have "inflected," and some things (CUDA dominance) have stayed the same.</p><p>What has remained elusive, however, is where differentiation will ultimately lie, specifically among startups.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>In the interest of minimizing noise, I will omit opining on the most common themes that other people have been discussing at length (inference vs. training hardware, open-source vs. closed-source models, large vs. small models, various picks and shovels, and fine-tuning&#8212;though the latter remains an unsolved problem for application developers). All these areas have one thing in common: they require a lot of money and are thus mostly suitable for larger players.</p><p>The question is&#8212;where is, or is there, a differentiated opportunity? (And by differentiated, I mean outside of dumping a ton of money into a seed-stage company against a deck and a team&#8212;which can be differentiating on its own&#8212;or likely not.)</p><p>Let&#8217;s look at the basic structure of an LLM-first SaaS app. What does it need?</p><ol><li><p>Access to a set of competent LLMs of different sizes at a reasonable cost and speed (many choices, nothing super differentiated).</p></li><li><p>A RAG system (i.e., pgvector)&#8212;not differentiated, especially with larger context windows, where the emphasis on retrieval accuracy is somewhat diminished.</p></li><li><p>An orchestration infrastructure: there&#8217;s basically a classic flow, such as intent detection, query rewriting, summarization, output checking, and correction. These frameworks will evolve, and there will be many options. In fact, with more and more LLM usage to write code itself, one could argue that frameworks like LangChain add more layers of complexity and problems than they solve&#8212;though I could be proven wrong. There could be new ways to think about frameworks in a world where most of the code would be generated.</p></li><li><p>A testing  and observability infrastructure&#8212;definitely a major pain, but the problem area is obvious and many people are working on it. The question is whether it will spur s slew of new SaaS companies, or - considering immaturity of frameworks it will be more tightly coupled with service providers offering models and data services?</p></li><li><p>Coding tools (Copilot, Cursor, etc.)&#8212;again, quite obvious, and various tools will exist wrapping around ever-better models. The IDE becomes more of a very complex prompting interface and a feedback collection tool. There are several possibilities here:</p><ul><li><p>A revolutionary new interface that we have not yet seen or thought of</p></li><li><p>A plethora of Cursor-like forks and VSCode plug-ins, though it&#8217;s difficult to see how they win, unless Microsoft simply yields. Cursor being superior to GitHub Copilot is mostly a function of Cursor RAG-ing your codebase, and GitHub being presumably more &#8220;privacy&#8221; conscious. A simple business decision by <a href="https://www.linkedin.com/in/jkevinscott/">Kevin Scott</a> can alter that equation overnight.</p></li></ul></li><li><p>Differential fine-tuning and associated dataset management&#8212;no great solutions, but probably best addressed by model providers and companies like <a href="https://scale.ai">Scale.AI</a>.</p><p></p></li></ol><p>Alright. Some winners could emerge (especially around testing, and observability). It will likely converge into a few larger winners, most closely tied to model and data providers.</p><p>What are we overlooking? Can building an LLM wrapper be intrinsically differentiated? By intrinsically, I mean: is there an accumulation of leverage that creates a competitive moat over time? There&#8217;s always an opportunity to build a better product&#8212;better UX, clever influencer marketing, superior GTM&#8212;that leads to the acquisition of market share, and that can be differentiated, albeit hardly a sure thing.</p><p>There is likely one area that can be a source of leverage. We know proprietary data is that source in AI. But that&#8217;s too unstructured; to be differentiated, it needs to lead to a vastly superior outcome that is not obtainable by just using commonly available models.</p><p>So, what form is that, and how might it work?</p><p>When we look at a classic SaaS workflow (for example, responding to a customer service query), what happens is quite specific to each customer in two ways:</p><ol><li><p>Prompts are likely different for each domain and further different for each customer.</p></li><li><p>Prompts involving function calling (i.e., opening a Jira ticket) are definitely different, especially if a custom integration is involved.</p></li><li><p>Workflow routing logic may be different (though this is technically outside LLM data).</p></li><li><p>Error checking and validation at the end of the action can likewise be very specific to a customer domain and environment.</p></li></ol><p>Moreover, in other cases where user feedback is part of the loop, prompts may be dynamically updated (via LLM).</p><p>So, these prompts, in fact, constitute a customer-specific and user-specific dataset that grows over time, requires deep knowledge of customer workflows (and thus is not easily replicable), and grows over time.</p><p>Is this enough differentiation? Unclear&#8212;I think it depends on whether business workflows can converge and be serviced by a more intelligent model. However, to do so will require understanding these workflows first. While I think ultimately it will converge, there&#8217;s probably room on the way there.</p><p>Love to hear your thoughts,  </p><p>Ruslan</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[The supplements]]></title><description><![CDATA[Update (Mar, 2026)]]></description><link>https://www.equationblog.com/p/the-supplements</link><guid isPermaLink="false">https://www.equationblog.com/p/the-supplements</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sun, 26 May 2024 14:15:57 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div><hr></div><p><strong>Update (Mar, 2026)</strong></p><p><strong>Added:</strong></p><ul><li><p><strong><a href="https://www.amazon.com/dp/B01A68H2BA?ref=ppx_yo2ov_dt_b_fed_asin_title&amp;th=1">TUDCA</a>: 1 service, 250mg AM/PM</strong></p><ul><li><p>TUDCA is here primarily as a liver and bile-flow support compound. Unlike many &#8220;liver support&#8221; botanicals, it is a bile acid derivative, so the rationale is more targeted: it helps make the bile acid pool more hydrophilic, may reduce stress on hepatocytes, and has a cleaner mechanistic case in the setting of a crowded stack plus mild liver-enzyme drift. I&#8217;m treating it as pragmatic insurance rather than a performance supplement &#8212; the goal is preserving liver resilience and bile handling, not expecting any noticeable day-to-day effect.</p></li></ul></li></ul><p><strong>Removed:</strong></p><ul><li><p><strong><a href="https://www.amazon.com/gp/product/B00T5A1SD4/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Acetyl L-Carnitine HCI</a></strong>: 1 serving, 1g</p><ul><li><p>It does help transport fatty acids into mitochondria, acetyl-CoA group donor providing brain health benefits and good nootropic effects, and I did feel moderate benefits for the workouts. The downside - it feeds bacteria in the gut that produce TMAO. While TMAO impact is not completely settled, I am removing ALCAR on risk/reward basis, especially considering absorption problems with oral forms in general.</p></li></ul></li><li><p><strong><a href="https://www.amazon.com/gp/product/B07WVSYH5P/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Cordyceps</a></strong>: 1 serving, 2g</p><ul><li><p>Preliminary research suggested that Cordyceps may have positive effects on telomere length. Subjectively I did not feel any effect and upon further examination telomere length rationale is quite weak. Removing to simplify the stack.</p></li></ul></li></ul><p></p><p><strong>Removed</strong>:</p><ul><li><p><strong><a href="https://www.amazon.com/dp/B001DNV5CA?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">EGCG</a></strong> 1/2 serving, 1 cap, 200mg 2x/day</p><ul><li><p>Originally the thesis here was colorectal risk mitigation (there is mechanistic plausibility + some human data in the adenoma space), but on the latest labs I saw a mild upward drift in liver enzymes. Green tea extracts are one of the more common &#8220;usually fine, but occasionally not&#8221; botanicals from a liver standpoint &#8212; and when you&#8217;re monitoring LFTs, anything with even a small hepatotoxicity signal is a poor fit. Net: risk/reward no longer pencils out for me right now, so it&#8217;s out until LFTs normalize and a future re-challenge (if any) can be done cleanly.</p></li></ul></li><li><p><strong><a href="https://www.amazon.com/dp/B00LPJJHRC?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">AC-11</a></strong>  1cap 350mg into the morning shake</p><ul><li><p>This was a pure experiment around the &#8220;DNA repair / resilience&#8221; claims. After a reasonable run, the subjective benefit was barely detectable and I couldn&#8217;t find anything objective to justify keeping it in. Given how crowded the stack already is, anything that isn&#8217;t clearly pulling its weight gets cut.</p></li></ul></li><li><p><strong><a href="https://www.amazon.com/dp/B07H8NK8Y1?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">Gotu Kola extract</a></strong>: 1/2 serving, 1 cap, 120mg</p><ul><li><p>I like Gotu Kola for microcirculation / skin-collagen support, but similarly to EGCG it&#8217;s a botanical with rare (but real) hepatotoxicity case reports. With LFTs already trending the wrong way, this becomes an obvious &#8220;first things to pull&#8221; candidate. Removed for now.</p></li></ul></li></ul><p><strong>Added</strong>:</p><ul><li><p><strong><a href="https://www.amazon.com/dp/B0DNMWS7WX?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">Urolithin A</a></strong>, 1/2 service, 1 cap, 1000mg, PM</p><ul><li><p>Urolithin A is here for mitochondrial &#8220;quality control&#8221; via mitophagy &#8212; i.e., preferential cleanup of dysfunctional mitochondria rather than just suppressing ROS. Another reason I like it: humans vary a lot in how well they can generate urolithins from food (microbiome-dependent), so supplementing can bypass that variability. The human RCT data isn&#8217;t perfect, but it&#8217;s stronger than most &#8220;mitochondrial&#8221; supplements, with safety/tolerability plus measurable signals on muscle endurance / mitochondrial health markers.</p></li></ul></li><li><p><strong><a href="https://www.amazon.com/gp/product/B07BH4JG35/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">PQQ</a></strong>: 1 cap, 20mg (AM)</p><ul><li><p>PQQ is a redox-active compound with a plausible role in mitochondrial biogenesis / cellular energy metabolism, and there are a few small human studies with signals on inflammation markers and cognitive endpoints &#8212; nothing definitive. I previously ran it and couldn&#8217;t find a clean personal signal, so conviction remains low. I&#8217;m re-running it only because it pairs conceptually with Urolithin A (different mechanism, same &#8220;mitochondrial cleanup + support&#8221; direction) and it&#8217;s easy to discontinue if it stays silent.</p></li></ul></li><li><p><strong><a href="https://www.amazon.com/dp/B0D3SNSJ47?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">Colostrum</a></strong> 1 scoop, 1g, morning shake</p><ul><li><p>Colostrum is a &#8220;gut insurance&#8221; lever for me: bovine colostrum concentrates immunoglobulins (IgG), lactoferrin, and other bioactives that support barrier integrity and innate immune function. The best human data I&#8217;ve seen is in athletes / high training stress contexts, where colostrum has been associated with improved gut permeability markers and fewer URTI-type issues. I&#8217;m using a conservative dose (1g/day) and treating it as supportive &#8212; not magic. Avoid if dairy allergy is a concern.</p></li></ul></li></ul><p><strong>Updated</strong>:</p><ul><li><p><strong><a href="https://nootropicsdepot.com/synapsa-bacopa-monnieri-extract-whole-plant-powder/">Bacopa Monirelli</a></strong> (Synapsa): 1 serving, 320mg</p><ul><li><p>Formulation changed to <a href="https://www.amazon.com/dp/B0BQWY5BB9?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">Cognance</a> 1 serving, 100mg with rationale being a better overall profile with no potential sedation effects.</p></li></ul><p></p></li></ul><p><strong>Update (Oct, 2025)</strong></p><ul><li><p><strong>Replaced <a href="https://www.amazon.com/dp/B07QNMCXHG?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">Reishi mushroom powder</a> </strong>with<strong> <a href="https://www.amazon.com/dp/B0F5F24B1K?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">Turkey Tail </a>mushroom powder</strong>: 1 serving, 1g (2 days on, 1 day off - non overlapping with Lion&#8217;s Mane).<strong> </strong>Reasons:</p><ul><li><p><strong>Safety/monitoring</strong>&#8212;Reishi has rare but documented hepatotoxicity case reports and inclusion in LiverTox, which matters if you&#8217;re watching LFTs; Turkey Tail has a long clinical track record as an adjuvant (PSK) with generally favorable tolerability.</p></li><li><p><strong>Evidence fit</strong>&#8212;Turkey Tail&#8217;s PSK/PSP have been studied extensively as immune modulators (including RCTs), with signals on NK-cell activity and broader immune function, making it a reasonable like-for-like immunologic substitute</p></li><li><p><strong>Potential 5-AR inhibition</strong>: There&#8217;s some weak evidence that Reishi may have an anti-androgenic potential, which we obviously don&#8217;t want</p></li></ul></li><li><p><strong>Added another</strong> <strong><a href="https://www.amazon.com/gp/product/B00E9M4XFI/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Creatine</a></strong>: 1 serving, 5g to the morning shake for 10g total (between pre-workout and the morning shake). New research strongly suggest cognitive benefits (especially when short on sleep) from this dosing and subjectively I feel a noticeable boost and no afternoon slump at all.</p></li></ul><p><strong>Update (Aug, 2025)</strong></p><p><strong>Added</strong>:</p><ul><li><p>Tongat Ali was replaced <a href="https://nootropicsdepot.com/eurycomax-optimized-tongkat-ali/?searchid=14818292&amp;search_query=eury">Eurycomax</a> and correspondingly 2.5mg of Pregnenolone was removed. In total between Cistamax, Eurycomax and direct supplementation DHEA and Pregnenolone dosages stayed at 5mg each.  So far appears to be a much smoother stack.</p></li><li><p><strong><a href="https://nootropicsdepot.com/maca-extract-powder/?searchid=14818332&amp;search_query=maca+root">Maca</a></strong> extract powder was added to the morning shake at 200mg. See notes.</p></li><li><p><strong><a href="https://nootropicsdepot.com/search.php?search_query=gotu%20kola">Gotu Kola</a></strong> was added into the morning shake along with the evening dose.</p></li><li><p><strong><a href="https://www.amazon.com/dp/B001DNV5CA?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">EGCG</a></strong> was added into the morning shake along with the evening dose. See notes.</p></li><li><p><strong><a href="https://www.amazon.com/dp/B0014UECK4?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">Stinging Nettle Root Root</a></strong> extract was added into the morning shake along with the evening dose.</p></li><li><p><strong>NMN/NR</strong> stack was restructured and replaced with <a href="https://www.amazon.com/dp/B07TK4VYWS?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">NR</a> (Tru Niagen) plus Pterostilbene+Quercetin. See detailed discussion.</p></li><li><p><strong><a href="https://www.amazon.com/dp/B00LPJJHRC?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">AC-11</a></strong> was added experimentally, 1cap 350mg into the morning shake. So far seem to generate a slight mental kick. Data-wise not yet convinced.</p></li><li><p><strong>Creatine</strong> was moved from morning shake to pre-workout. Seemed to make a significant difference, probably due to transient DHT elevation.</p></li><li><p><strong><a href="https://www.drugs.com/rosuvastatin.html">Rosuvastatin</a></strong> (Crestor) along with <strong><a href="https://www.amazon.com/dp/B0CKC3TXCC?psc=1&amp;ref=ppx_yo2ov_dt_b_product_details">Geranylgeraniol</a></strong> were moved from <strong>AM</strong> to <strong>PM</strong> to avoid (theoretical) transporter conflicts with other supplements.</p></li><li><p><strong><a href="https://www.amazon.com/dp/B07ZPH99CQ?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">H2 Molecular Hydrogen</a> </strong>was added along with the morning supplements. See notes.</p></li></ul><p><strong>Removed</strong>:</p><ul><li><p><strong><a href="https://www.amazon.com/gp/product/B0036THML2/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1https://www.amazon.com/gp/product/B0036THML2/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;th=1">Gaia Herbs Adrenal Health Daily</a> -</strong> I took this for years; it made my <a href="https://www.equationblog.com/p/my-personal-daily-routine">original list</a> because it seemed helpful during a heavy-stress period. That said, I&#8217;ve always been skeptical of &#8220;adrenal support&#8221; blends&#8212;sensible ingredients, but usually well below clinically studied doses. After multiple on/off cycles, I&#8217;m convinced it isn&#8217;t doing anything meaningful (and at worst may slightly lower morning cortisol from the small amount of ashwagandha).</p></li></ul><p><strong>Update (Sep, 2024</strong>):</p><p><strong>Added</strong>:</p><ul><li><p><strong>Spermidine </strong>(see AM morning shake section)</p></li><li><p><strong>Magnesium Glycinate</strong>, 200mg in the evening replacing Magnesium complex</p></li></ul><p><strong>Removed</strong>:</p><ol><li><p><strong><a href="https://www.amazon.com/dp/B08NCFV5RJ?psc=1&amp;ref=ppx_yo2ov_dt_b_product_details">C60</a></strong> (in olive oil): 1 tsp</p><p>The premise: an interesting compound due to it&#8217;s direct and unique anti-oxidant action that doesn&#8217;t blunt effects of exercise.</p><p>It was an experiment and thus far I have not found neither subjective nor objective evidence of it doing anything (perhaps it was a problem with a specific brand).</p></li><li><p><strong><a href="https://www.amazon.com/gp/product/B07BH4JG35/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">PQQ</a></strong>: 1 cap, 20mg</p><p>The premise: research around mitochondrial support effects is quite solid. Quantitatively or subjectively I couldn&#8217;t see or feel anything. </p></li><li><p><strong>Probiotics</strong></p><p>After experimenting with several over the years, I found them to be useful for a specific purpose, but not as part of a maintenance routine.</p><p></p></li></ol><div><hr></div><p>Everyone is asking what supplements I take. I am going to share full list upfront and then we are going have a discussion.</p><p>Several disclaimers:</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><ul><li><p>This list is highly personalized and is periodically revised based on  quantitative testing, research updates, and subjective evaluation. What you see here is a snapshot as of today.</p></li><li><p>None of this is a medical advice. Please, do not apply this list to yourself without proper medical guidance.</p></li><li><p>It's crucial to remember that, from a scientific perspective, supplements (and even diet) are not as significant as one might think compared to exercise. Nutrition and supplement studies are typically of low quality and should be viewed with skepticism. They are most often based on epidemiology, which is unreliable due to confounding factors, biases, and difficulty in establishing causality. Although Mendelian randomization trials are more robust in inferring causality, they are limited in number due to costs and incentives. When they are done - these trials are often underpowered, too short, and tend to focus on populations that may not be relevant to our purposes.</p></li><li><p>Therefore, the totality of data, including observational data from various traditions, along with personal biomarkers, functional fitness metrics, and subjective evaluations, must all be considered.</p></li><li><p>Given the difficulty of evaluating efficacy, why do I pursue this? Partially, it's for the fun of biohacking.</p></li><li><p>The main goals for me are health span and performance (not longevity by itself - who wants to live longer without being able to do things anyway).</p></li></ul><p>Here&#8217;s how to read the list:</p><ul><li><p><strong>Supplement</strong>: (including the specific brand I use and dosages; note that I have no affiliation with any of the companies mentioned).</p></li><li><p><strong>Commentary</strong>: I&#8217;ll provide brief commentary without references (for the sake of my time). Accuracy is not guaranteed. Feel free to discuss it further with ChatGPT.</p></li><li><p><strong>Risk</strong>: low/medium/high (I don&#8217;t use anything I consider high risk). No supplement is zero risk as unknown effects may exist, and there could be discrepancies versus the label due to manufacturing or quality control issues.</p></li><li><p><strong>Personal outcome</strong>: none/minor/notable/significant - a subjective evaluation of the benefit based on an aggregate of biomarker, functional testing, and subjective feelings of well-being.</p></li><li><p><strong>Conviction</strong>: low/medium/high - my current level of conviction in the supplement's efficacy based on the aggregate of data, research and personal outcome.</p></li></ul><p>Let&#8217;s get right to it.</p><h1>AM - Pre-workout</h1><p>As you know from my <a href="https://open.substack.com/pub/ruslan/p/my-personal-daily-routine?r=jthe7&amp;utm_campaign=post&amp;utm_medium=web">earlier post</a> - I <a href="https://open.substack.com/pub/ruslan/p/my-current-workout-routine?r=jthe7&amp;utm_campaign=post&amp;utm_medium=web">workout</a> first thing in the morning and that drives some of the logic here (if I was to work out in the afternoon or the evening - the supplementation would be different)</p><p>The following supplements are mixed in as powders into a pre-workout drink:</p><p><strong><a href="https://www.amazon.com/gp/product/B09SP1Q3T9/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Essential Amino Acids</a></strong> (EAAs): 1 serving, 11g</p><p>As I work out in the morning and we know that in a hypo-caloric state the body can break down muscle for energy, having circulating amino acids (in addition to the stimulus from exercise) helps preserve muscle mass. Essential Amino Acids (EAAs) are beneficial because they are already in their simplest form and are readily available for muscle protein synthesis. This makes them easier for the body to use immediately compared to whole proteins, which need to be broken down first. Additionally, EAAs are easier on the digestive system.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: significant</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://www.amazon.com/gp/product/B00QYZ6MLG/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Legion Pulse</a></strong> Pre-Workout drink: 1/2 serving, 11.75g</p><p>This product serves as both a nootropic supplement and a pre-workout aid. It&#8217;s a well-designed product with no filler or questionable ingredients. The main components are: a moderate dose of caffeine for increased energy and focus, L-Citrulline for nitric oxide enhancement and improved blood flow, Betaine/TMG which helps control homocysteine levels (this ingredient combines nicely with other supplements, such as NAD precursors) and has several performance benefits, L-Theanine for its calming and focus-enhancing effects, Alpha-GPC as an effective acetylcholine precursor for cognitive enhancement, and electrolytes (sodium and potassium) to support hydration and muscle function.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: significant</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://www.amazon.com/gp/product/B00VAOKFO6/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Taurine</a></strong>: 1 serving, 2g</p><p>The research on taurine is impressive on many levels, demonstrating benefits for cardiovascular health and muscle function. However, I am not sure it&#8217;s doing anything noticeable for me personally. Taurine supplementation might be more important for people on a vegetarian or vegan diet, as they may have lower taurine levels due to the absence of taurine-rich animal products in their diet.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: none</p><p><strong>Conviction</strong>: low</p><p><strong><a href="https://www.amazon.com/gp/product/B081PBMGYJ/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Collagen</a></strong>: 1 serving, 10.29g</p><p>Collagen helps complement the amino acid profile provided by EAAs, offering specific amino acids like glycine, proline, and hydroxyproline. Research suggests that taking collagen prior to a workout, assuming adequate exercise stimulus, can support connective tissue and joint health, with some evidence also indicating benefits for bone health. This particular product uses special formulation of collagen and has additional ingredients such as Hyaluronic Acid, ch-OSA, and Buffered Vitamin C, which are known for their synergistic benefits for joint health.</p><p><strong>A side note</strong>: excess collagen, due to conversion of hydroxyproline into oxalates. Between this dose and the morning shake it comes down to 20g / day for me,  which may be close to an upper safe dose.</p><p><strong>Risk</strong>: moderate</p><p><strong>Personal outcome</strong>: notable</p><p><strong>Conviction</strong>: high</p><p><a href="https://www.amazon.com/gp/product/B0791NCWPL/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">N-Acetyl L-Tyrosine</a> (occasionally): 1 serving, 0.4g</p><p>N-Acetyl L-Tyrosine (NALT) is a precursor to neurotransmitters such as dopamine, norepinephrine, and epinephrine, as well as thyroid hormones like thyroxine (T4) and triiodothyronine (T3),, which can enhance mental focus and cognitive performance. I take it occasionally for an extra kick during workouts, as it helps improve alertness and mental clarity, contributing to a more energized workout experience.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: notable</p><p><strong>Conviction</strong>: moderate</p><p>The following supplements are taken separately as capsules along with the pre-workout drink at the same time:</p><p><strong><a href="https://nootropicsdepot.com/cistamax-capsules/">CistaMax</a></strong> (5 days on, 2 days off, 2 weeks washout every 10 weeks): 1 serving, 1 cap</p><p>This is an amazing combo and you have to admire the thinking that went into designing it. It would take a couple of pages to dissect it - so for the sake of time do your own research here.</p><p><strong>Risk</strong>: moderate</p><p><strong>Personal outcome</strong>: significant</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://nootropicsdepot.com/eurycomax-optimized-tongkat-ali/?searchid=14818292&amp;search_query=eury">Eurycomax</a></strong> (5 days on, 2 days off, 2 weeks washout every 10 weeks): 1 serving, 2 caps</p><p>Eurycomax is the new great combo based around Tongat Ali with additional ingredients, and it seems to avoid some of the concerns with pure Tongat by better balancing estrogen suppression. A reminder - anytime you are tinkering with hormones even a little, quantitative testing is a must to ensure safety and efficacy.</p><p><strong>Risk</strong>: moderate</p><p><strong>Personal outcome</strong>: significant</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://nootropicsdepot.com/dhea-quick-dissolve-tablets-micronized/?searchid=14818447&amp;search_query=dhea">DHEA</a></strong>: 1/2 serving, 1/2 tab, 2.5mg (5 days on, 2 days off)</p><p>At very low doses - such in in my case for a total of  5mg of DHEA and 5mg or Pregnenolone between DHEA tablet, Eurycomax and Cystamax this helps nudge, especially as we age the process of steroidogenesis in the right direction, towards testosterone synthesis without triggering negative effects and feedback loops.</p><p><strong>Risk</strong>: moderate</p><p><strong>Personal outcome</strong>: significant</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://www.amazon.com/gp/product/B00E9M4XFI/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Creatine</a></strong>: 1 serving, 5g</p><p>The benefits of creatine are well known - both for muscle endurance and brain (and even systemically). </p><p>There&#8217;s a plausible risk of DHT elevation (and resulting hair loss). The one frequently quoted  study was not replicated and there&#8217;re other issues with it (however, anecdotally there are a lot of bold body builders). Either way keeping an eye on DHT levels is recommended if hair loss is a concern.</p><p>Secondarily, creatine supplementation will elevate blood creatinine levels that will show up on a standard metabolic panel test (such as CMP). This is not a concern - if the cause is Creatine supplementation. However tell your doctor, so a more accurate biomarker can be used to assess kidney health (such as Cystatin C).</p><p>Finally - note that creatinine as well as liver enzymes (ALT most notably) can also be elevated due to normal muscle breakdown as a result of a strength training session, so likewise tell your doctor or use a wash out period (72 hours - I am not doing it).</p><p><strong>Note</strong>: when you severely underslept for whatever reason, additional 5mg of Creatine later in the day (I&#8217;d just add that extra dose into the morning shake) could be quite helpful as a temporary fix.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: significant</p><p><strong>Conviction</strong>: high</p><h1>AM - Breakfast (Post workout)</h1><p>These are taken in conjunction with my morning shake and whenever possible I use powders (instead of capsules, to avoid filler ingredients and flow agents) and capsules (whenever a powder form is not available) are broken up and put into the shake. Using powders is also more cost effective.</p><h2>Core Morning Shake recipe</h2><p><strong><a href="https://www.amazon.com/gp/product/B06XWXTX49/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Whey Protein</a></strong>: 25g</p><p><strong><a href="https://www.amazon.com/gp/product/B08L6Y12Q4/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Collagen Protein</a></strong>: 9g</p><p><strong><a href="https://www.amazon.com/dp/B0F22L62DC?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">Fiber Powder</a></strong>  (most days, but sometimes I skip adding it when I want to give my gut a break from extra fiber)</p><p><strong><a href="https://www.amazon.com/gp/product/B08KYLFGH7/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Wild blueberry</a></strong> or <strong><a href="https://www.amazon.com/gp/product/B07QTCDMBW/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">elderberry powder</a></strong>: 5g (alternating) </p><p><strong><a href="https://www.amazon.com/gp/product/B001TNW23U/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Pomegranate powder</a></strong>: 1 tbs</p><p><strong><a href="https://www.amazon.com/gp/product/B07G3KLNM7/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Tart cherry powder</a></strong>: 4.8g (among many other benefits this helps to keep uric acid at low levels).</p><p><strong>Olive oil</strong>, 2 tsps (to make sure there&#8217;s a little bit of fat to help with absorption of fat soluble vitamins).</p><p>Some sort of fruit from whatever is around, could fresh or be frozen: banana, a cup of strawberries, raspberries, an apple, 2 peaches, 3 apricots, etc.</p><h2>Supplements that are mixed into the shake</h2><p><strong><a href="https://www.amazon.com/gp/product/B01BGZVXC6/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">O.N.E. multivitamin</a></strong>: 1 serving, 1 cap</p><p>Think of it as an insurance policy, although given my genetics - methylated B vitamins are great. There&#8217;s no perfect multi-vitamin and it&#8217;s a compromise for convenience.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: minor</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://www.amazon.com/gp/product/B00BN4OMW4/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Magnesium</a></strong> complex: 1/2 serving, 1 cap, 120mg</p><p>The benefits are well documented, especially considering that it is nearly impossible to get enough magnesium through the regular modern diet (with all the soil depletion).</p><p><strong>Risk: low</strong></p><p><strong>Personal outcome: </strong>minor</p><p><strong>Conviction: </strong>high</p><p><strong><a href="https://www.amazon.com/gp/product/B00NG1N8OU/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Glucosamin HCI</a></strong>: 1 serving, 1g</p><p>Decent research for joint (formation and repair of cartilage, synovial fluid production) and even brain health.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: minor</p><p><strong>Conviction</strong>: moderate</p><p><strong><a href="https://www.amazon.com/dp/B0D3SNSJ47?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">Colostrum</a></strong> 1 scoop, 1g, morning shake</p><p>Colostrum is a &#8220;gut insurance&#8221; lever for me: bovine colostrum concentrates immunoglobulins (IgG), lactoferrin, and other bioactives that may support barrier integrity and innate immune function. The best human data I&#8217;ve seen is in athletes / high training stress contexts, where colostrum has been associated with improved gut permeability markers and fewer URTI-type issues. I&#8217;m using a conservative dose (1g/day) and treating it as supportive &#8212; not magic. </p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: minor</p><p><strong>Conviction</strong>: moderate</p><p><strong><a href="https://www.amazon.com/dp/B0014UECK4?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">Stinging Nettle Root Extract</a></strong>, 1/2 serving, 1 cap, 250mg</p><p>Mechanism of action is anti-androgen activity specific to prostate, thus not affecting androgens systemically, which is quite nice. The outcomes are supported by solid data, including human trials.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: significant</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://www.amazon.com/gp/product/B07BH4JG35/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">PQQ</a></strong>: 1 cap, 20mg</p><p>PQQ is a redox-active compound with a plausible role in mitochondrial biogenesis / cellular energy metabolism, and there are a few small human studies with signals on inflammation markers and cognitive endpoints &#8212; nothing definitive. I previously ran it and couldn&#8217;t find a clean personal signal, so conviction remains low. I&#8217;m re-running it only because it pairs conceptually with Urolithin A (different mechanism, same &#8220;mitochondrial cleanup + support&#8221; direction) and it&#8217;s easy to discontinue if it stays silent.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: none</p><p><strong>Conviction</strong>: low</p><p><strong><a href="https://www.amazon.com/dp/B0BQWY5BB9?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">Bacopa Monirelli</a></strong><a href="https://www.amazon.com/dp/B0BQWY5BB9?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1"> </a>(Cognance): 1 serving, 100mg</p><p>Bacopa is a long-running cognitive &#8220;insurance&#8221; herb with human trial data suggesting benefits for memory/processing after weeks of consistent use (with GI side effects being the most common downside). The trade-off with traditional bacopa extracts is that some people feel slightly sedated or &#8220;flat.&#8221; Cognance is a newer bacopa extract with a different standardization profile, and the bet here is keeping the brain-support upside while avoiding the sedation tax.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: minor</p><p><strong>Conviction</strong>: moderate</p><p><strong><a href="https://nootropicsdepot.com/maca-extract-powder/?searchid=14900538&amp;search_query=maca+extract">Maca Extract Powder</a></strong> (5% macamides): 1 serving, 125mg</p><p>An interesting compound that inhibits <strong>FAAH</strong> (the enzyme that breaks down anandamide), which can raise endocannabinoid tone. The result is a noticeable mood and well being elevation. The safety record is excellent and it appears not to affect hormones or major neurotransmitters.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome:</strong> notable</p><p><strong>Conviction</strong>: moderate</p><p><strong><a href="https://www.amazon.com/dp/B07W8TL9DZ?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">Lion&#8217;s Mane</a></strong>: 1 serving, 1g,  1 day on, 2 days off</p><p>For brain benefits as it is known to increase both BDNF (Brain-Derived Neurotrophic Factor)) and especially NGF (Nerve Growth Factor).</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: minor</p><p><strong>Conviction</strong>: moderate</p><p><strong><a href="https://www.amazon.com/dp/B0F5F24B1K?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">Turkey Tail</a> mushroom powder</strong>: 1 serving, 1g (2 days on, 1 day off - non overlapping with Lion&#8217;s Mane)</p><p>The idea is to boost NK-cell activity as a preventative. Turkey Tail &#8212;notably its PSK/PSP fractions and mycelium&#8212;has human data showing increased NK-cell tumoricidal activity and higher circulating NK cells vs. control: a phase-I study in breast-cancer survivors reported improved NK function and lymphocyte counts; a randomized double-blind trial linked rises in the NK-activation marker CD69 to functional gains; and oncology RCTs using PSK showed peripheral NK-cell increases compared with control.</p><p><strong>Note</strong>: if you have any kind of autoimmunity, I would avoid.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: minor</p><p><strong>Conviction</strong>: moderate</p><p><strong><a href="https://www.amazon.com/gp/product/B08QC4V25L/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Inositol</a></strong>: 1 serving, 1g</p><p>Supports healthy thyroid function and helps keeps TSH levels in the optimal range.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>:: notable</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://nootropicsdepot.com/milk-thistle-4-1-extract-silybum-marianum/">Milk Thisle</a></strong>, 1 serving, 100mg</p><p>Milk thistle is well-researched for its liver support properties, primarily due to its active compound, silymarin. Silymarin has antioxidant, anti-inflammatory, and antifibrotic effects, which help protect liver cells from damage, support liver regeneration, and improve liver function. Probably a good insurance with all the other supplements I am taking &#128578;. </p><p>While research supports these benefits, it hasn&#8217;t been enough time to ascertain the results for me personally.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: none</p><p><strong>Conviction</strong>: low</p><p><strong><a href="https://www.amazon.com/dp/B09WDJZYSQ?ref=ppx_yo2ov_dt_b_fed_asin_title">Spermidine</a></strong>, 1 service, 2 caps, 13mg equivalent</p><p>Spermidine induces autophagy by inhibiting the activity of key enzymes involved in acetylation, such as histone acetyltransferases. This inhibition leads to the deacetylation of autophagy-related genes, which promotes the formation of autophagosomes&#8212;vesicles that engulf and degrade damaged proteins, organelles, and cellular debris.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: none</p><p><strong>Conviction</strong>: moderate</p><p><strong><a href="https://www.amazon.com/dp/B07TK4VYWS?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">TruNiagen NR</a></strong>: 1 serving, 1 cap, 300mg, combined with <strong><a href="https://www.amazon.com/dp/B07QB7DP4G?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_6">Quercetin + Pterostilbene</a></strong>, 1 serving, 2 caps, 500mg Quercetin, 50m of Pterosilbene)</p><p>This deserves a bit of discussion. The whole idea of using NAD+ precursors came out of research by David Sinclair. We know NAD+ decreases as we age and it is important with 1000s different things, most interestingly with Sirtuin activation (Resveratrol or similar compounds, such as Pterostilbene).</p><p>However there&#8217;re many issues with it:</p><ol><li><p>No positive effects have been observed in humans thus far (except for one study on cognitive health with NR)</p></li><li><p>Both NR and Resveratrol failed <a href="https://www.nia.nih.gov/research/dab/interventions-testing-program-itp">ITP</a> studies (which thus far is a gold standard for longevity research).</p></li><li><p>The original Resveratrol study has been thoroughly discredited due to study design (in fact it turned out to be so bad, people wonder how it (and it&#8217;s authors) got so much attention in the first place),</p></li><li><p>There is a rodent study showing cancer growth acceleration effects from NR (for pre-existing cancers - mechanistically this makes sense as NAD+ is quickly depleted by hungry cancer cells, and replenishing NAD+ will only accelerate that process)</p></li><li><p>There is a concern of up-regulating CD38, although in moderate doses this should not be a concern (there&#8217;s yet to be determined U-shape curve here that may depend on many variables). This could potentially be mitigated by Quercetin.</p></li><li><p>Bio-availability of Resveratrol is low (I am using Pterostilbene instead for that reason, although it&#8217;s not clear that it is doing anything either).</p></li><li><p>In addition any of these precursors need to be supplemented with TMG (in my case there&#8217;s plenty in Legion pre-workout drink)  to prevent elevation of homocysteine due to back conversion to nicotinamide.</p></li></ol><p>That said - mechanistic theory around how all this could work for longevity is quite compelling and it is possible that it would take considerable time for it to work.</p><p>Subjective effects for me in terms of feeling more energetic, however the question is whether  supplementing with plain old nicotinamide would accomplish the same outcome (including NAD+ support) for a fraction of the cost.</p><p>Finally - there&#8217;s still a serious, unresolved conundrum here. NAD+ enhances cellular energy by participating in mitochondrial redox reactions, boosting ATP synthesis (hence feeling energetic). It is also essential for sirtuin activation, which signals autophagy and repair&#8212;processes that act as evolutionary sensors for low nutrient levels.</p><p>However, consuming a lot of protein, necessary for maintaining muscle mass, stimulates the mTOR pathway, contradicting these aims. The mTOR pathway promotes cell growth and inhibits autophagy, counteracting the benefits. An alternative approach involving cycling and agents like <a href="https://www.drugs.com/mtm/sirolimus.html">Rapamycin</a> might be better, but the exact protocols and pros and cons are unclear. Most importantly, I would have no idea how to reconcile it with my everyday workout routine.</p><p><strong>Risk</strong>: moderate</p><p><strong>Personal outcome</strong>:: notable</p><p><strong>Conviction</strong>: low</p><h2>Supplements taken along with the shake / breakfast</h2><p>These are supplements that are not (or cannot be) available in a powder form or breakable capsules.</p><p><strong><a href="https://www.amazon.com/gp/product/B07VF9HB74/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">EPA/DHA</a></strong>: 1/2 serving, 1 cap</p><p>Not much to opine on, the benefits of EPA/DHA are well researched. I keep my omega index &gt;8 (8.94% as of last test) on <a href="https://omegaquant.com/">OmegaQuant</a> test (your dose may vary, so test it).</p><p><strong>Risk: low</strong></p><p><strong>Personal outcome: </strong>minor</p><p><strong>Conviction: </strong>high</p><p><strong><a href="https://www.lifeextension.com/vitamins-supplements/item02334/super-k">Vitamin K1/MK-4/MK-9</a></strong>: 1 serving, 1 cap</p><p>A must considering supplementation with Vitamin D (O.N.E. multivitamin has 2000 IU), plus other vascular and bone health benefits.</p><p><strong>Risk: low</strong></p><p><strong>Personal outcome: </strong>none (as far as subjective feeling, calcium score continues to stay at 0 and bone density on DEXA went up - so that&#8217;s be pretty good)</p><p><strong>Conviction: </strong>high</p><p><strong><a href="https://www.amazon.com/gp/product/B073VL4WKN/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">COQ10</a></strong>: 1 serving, 1 cap, 100mg</p><p>The idea here is to counteract effects of a statin that whacks HMG-CoA enzyme and decreases COQ10 levels as a result. While studies are conflicted, the subjective feeling is noticeable for me.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>:: notable</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://www.amazon.com/dp/B01A68H2BA?ref=ppx_yo2ov_dt_b_fed_asin_title&amp;th=1">TUDCA</a>: 1 service, 250mg</strong></p><p>TUDCA is here primarily as a liver and bile-flow support compound. Unlike many &#8220;liver support&#8221; botanicals, it is a bile acid derivative, so the rationale is more targeted: it helps make the bile acid pool more hydrophilic, may reduce stress on hepatocytes, and has a cleaner mechanistic case in the setting of a crowded stack plus mild liver-enzyme drift. I&#8217;m treating it as pragmatic insurance rather than a performance supplement &#8212; the goal is preserving liver resilience and bile handling, not expecting any noticeable day-to-day effect.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>:: none</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://nootropicsdepot.com/smart-ps-phosphatide-complex-softgel-capsules-100mg-phosphatidylserine/">Smart PS&#8482; Phosphatidylserine</a></strong>: 1 serving, 1 cap</p><p>This is a blend of phosphatidylserine, phosphatidylcholine, and phosphatidylethanolamine and supports brain health by providing essential phospholipids that contribute to cognitive function, memory, and overall neural health by playing a key role in cell membrane integrity and the formation of synaptic vesicles.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>:: minor</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://www.amazon.com/gp/product/B000POSA2Q/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Aged Garlic Extract</a></strong>: 1/2 serving, 1 cap, 300mg</p><p>This is to mitigate the potential increase in TMAO levels caused by ALCAR and choline precursors, as elevated TMAO has been associated with cardiovascular risk. While the link between TMAO and ASCVD (Atherosclerotic Cardiovascular Disease) is not definitively proven (as otherwise people who eat a lot of fish would all die from heart attacks - which is obviously not the case), there is enough evidence to warrant some precaution. AGE has additional benefits, such as mild cholesterol-lowering effects, but those are not the primary reasons for taking it in this case. Note that garlic is a FODMAP and may cause digestive issues for some people, especially those with SIBO.</p><p><strong>Risk</strong>: low (see above)</p><p><strong>Personal outcome</strong>: none</p><p><strong>Conviction</strong>: low<br></p><p><strong><a href="https://www.amazon.com/dp/B07ZPH99CQ?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1&amp;th=1">H2 Molecular Hydrogen</a></strong>: 1 serving, 1 tablet (80mg magnesium, 8ppm), dissolved in water when taking other supplements.</p><p>There is a significant body of research suggesting that H2 is a selective antioxidant neutralizing mostly the harsh stuff (&#8226;OH, ONOO&#8315;) while sparing signaling ROS (e.g., H&#8322;O&#8322;) that drive exercise adaptations.</p><p>Risk: low</p><p>Personal outcome: none</p><p>Conviction: low</p><h2>Prescription medications</h2><p>Goes without saying that these always must be discussed with and prescribed by your doctor.</p><p><strong><a href="https://www.drugs.com/rosuvastatin.html">Rosuvastatin</a></strong> (Crestor): 5mg <strong>(PM)</strong></p><p>While there&#8217;s a lot of controversy around the use of statins - I am quite convinced based on available research. ASCVD progression is a stochastic process and keeping APO-B concentrations low is, in my opinion very much worth it.</p><p>That said - statins are not perfect drugs with a lot of off target effects. First, they work by suppressing HMG-CoA cascade, that causes all kinds of problems. Secondly it seems there&#8217;s causal evidence of increase in insulin resistance overtime. On top of that, there&#8217;re negative effects on liver (observed by modest elevation of liver enzymes and despite unclear clinical significance - this is still not cool).</p><p>That said, for me Rosuvastatin is still the best (of imperfect) choices. First - it is the only effective hydrophilic statin that (unlike lipophilic statins, such as <a href="https://www.drugs.com/atorvastatin.html">Atorvastatin</a> (Lipitor) doesn&#8217;t massively diffuse into tissues or crosses the blood-brain barrier (thus reducing potential side effects, such muscle soreness).</p><p><strong>Risk</strong>: moderate</p><p><strong>Personal outcome</strong>: notable (as measured by calcium score staying at zero)</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://www.drugs.com/tadalafil.html">Tadalafil</a></strong> (Cialis): 5mg <strong>(AM)</strong></p><p>While developed as an ED drug, I take it for it&#8217;s off target systemic effects on improving the blood flow. As we know - most ills stem from poor blood flow as we age. Improved circulation also enhances workout performance and supports overall cardiovascular health, contributing to smoother functioning of many bodily systems. Tadalafil also slightly lowers blood pressure, helping to maintain it within the optimal range.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>:: significant</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://www.drugs.com/cons/minoxidil-oral.html">Minoxidil</a></strong> (Loniten): 2.5 mg (<strong>PM</strong>)</p><p>I am using it as a preventative for hair loss at low dose (it is an old, repurposed blood pressure medication). It doesn&#8217;t seem to affect the blood pressure much for me, if anything it may be lowers it by a few points, which is a nice bonus. Otherwise no noticeable side effects.</p><h3>Coffee</h3><p>The discussion would not be complete without coffee. Coffee is one of the most widely used drugs in the world. 67% of Americans drink coffee (that&#8217;s why I suppose it just got removed from the CPI by the BLS as the price of coffee has been going up).</p><p>I take my coffee black (Americano) with no milk or sugar of course. I do add 1 tsp of <a href="https://www.amazon.com/gp/product/B07XTZRXCV/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">C8</a> MCT oil into my morning coffee.</p><p>On most days that be just one mug, but sometimes I also get the second cup around lunchtime. More than that starts adversely affecting my sleep quality - so if I must, I switch to decaf.</p><h1>PM (with dinner)</h1><p><strong><a href="https://www.amazon.com/gp/product/B07VF9HB74/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">EPA/DHA</a></strong>: 1/2 serving, 1 cap</p><p>The second dose as in AM.</p><p><a href="https://www.amazon.com/Magnesium-Glycinate-Capsules-Supplement-Supports/dp/B078SH5C4S/ref=sr_1_1_sspa?crid=39XNN6LQ7W7N3&amp;dib=eyJ2IjoiMSJ9.UoMfJKxej4_me1ExwJ82ID5RGK4DCBn6omrTt8b-GrY7E-hbviVg6W6eK5FQwWw24g_4PFd048HxT39tRHJ5C0MBu1wKgRRcCcoCqfpEbOPx1OyDFGSNyJsJmE4MKqpFoMIfvXr623pH3y6pqwsXXVDOcPSa5SsHWJruC3ISEGqbXTbeTyFiYRetLEdVknegxSOcXj4FZA_c2l1abmZR0hwQ1qbKluOQqJO0pXz9nceIzZ8KSqfU1-J6qk_9kQ7z3Fj6vq18g9uJSCk2ZnoQnLWtIde8NJSfHO_OYkOYiXI.pS_wK9zIxObxSpYq4A1pSsjrkrEluFX7mDAnDL8rOlM&amp;dib_tag=se&amp;keywords=magnesium%2Bglycinate&amp;qid=1730938542&amp;rdc=1&amp;sprefix=magnesium%2Bgl%2Caps%2C231&amp;sr=8-1-spons&amp;sp_csd=d2lkZ2V0TmFtZT1zcF9hdGY&amp;th=1">Magnesium</a> glycinate: 1 serving, 1 cap, 200mg</p><p>Seem to be more effective than than standard magnesium complex and has a notable calming effect for sleep.</p><p><strong>Risk: low</strong></p><p><strong>Personal outcome: </strong>significant</p><p><strong>Conviction: </strong>high</p><p><strong><a href="https://www.amazon.com/gp/product/B07BH65296/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Magnesium L-Threonate</a></strong>: 2/3 serving, 2 caps, 96mg</p><p>Brain health benefits and a minor calming effect for sleep.</p><p><strong>Risk: low</strong></p><p><strong>Personal outcome: </strong>minor</p><p><strong>Conviction: </strong>low</p><p><strong><a href="https://www.amazon.com/gp/product/B073VL4WKN/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">COQ10</a></strong>: 1 serving, 1 cap, 100mg</p><p>The second dose as in AM.</p><p><strong><a href="https://www.amazon.com/dp/B01A68H2BA?ref=ppx_yo2ov_dt_b_fed_asin_title&amp;th=1">TUDCA</a>: 1 service, 250mg</strong></p><p>The second dose as in AM.</p><p><strong><a href="https://www.amazon.com/gp/product/B01D15LMCK/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">BroccoMax</a></strong>: 1 serving, 2 caps</p><p>The research is decent, especially around colon cancer prevention effects. However I can&#8217;t quantitatively put a handle on it for me personally thus far.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>:: none</p><p><strong>Conviction</strong>: low</p><p><strong><a href="https://www.amazon.com/dp/B0DNMWS7WX?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">Urolithin A</a>,</strong> 1/2 service, 1 cap, 1000mg, PM</p><p>Urolithin A is here for mitochondrial &#8220;quality control&#8221; via mitophagy &#8212; i.e., preferential cleanup of dysfunctional mitochondria rather than just suppressing ROS. Another reason I like it: humans vary a lot in how well they can generate urolithins from food (microbiome-dependent), so supplementing can bypass that variability. The human RCT data isn&#8217;t perfect, but it&#8217;s stronger than most &#8220;mitochondrial&#8221; supplements, with safety/tolerability plus measurable signals on muscle endurance / mitochondrial health markers.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: none</p><p><strong>Conviction</strong>: moderate</p><p><strong><a href="https://www.amazon.com/dp/B0014UECK4?ref_=ppx_hzsearch_conn_dt_b_fed_asin_title_1">Stinging Nettle Root Extract</a></strong>, 1/2 serving, 1 cap, 250mg</p><p>The second dose as in AM.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: significant</p><p><strong>Conviction</strong>: high</p><p><strong><a href="https://www.amazon.com/gp/product/B000POSA2Q/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Aged Garlic Extract</a></strong>: 1/2 serving, 1 cap, 300mg</p><p>The second dose as in AM.</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: none</p><p><strong>Conviction</strong>: low</p><p><strong><a href="https://www.amazon.com/dp/B0CKC3TXCC?psc=1&amp;ref=ppx_yo2ov_dt_b_product_details">Geranylgeraniol</a></strong>: 1 serving, 1 cap</p><p>A fairly new compound that also counteracts negative effects of statins, by a different mechanism. </p><p>Geranylgeraniol (GGOH) is a critical intermediate in the mevalonate pathway, downstream of HMG-CoA reductase and it is necessary for the prenylation of small GTP-binding proteins. Prenylation is a post-translational modification that allows these proteins to anchor to cell membranes and function properly.</p><p>By supplementing with GGOH, the normal function of these prenylated proteins can be restored. Subjectively for me it appears to be even more efficacious than COQ10 (although I would keep both as mechanisms and effects are different).</p><p><strong>Risk</strong>: low</p><p><strong>Personal outcome</strong>: notable</p><p><strong>Conviction</strong>: high</p><h1>Discussion</h1><p>So, that's a lot of supplements. The question is: can you achieve the same results with just a good diet? Probably not.</p><p>Even if you source high-quality ingredients and are prepared to spend an inordinate amount of time cooking everything perfectly, it's unlikely you'll achieve all the effects and benefits tailored to your genetics, personal health circumstances, and performance goals without some scientific enhancement.</p><p>The second question: can you avoid supplements altogether? Of course, you can. The human body is very resilient and can tolerate suboptimal conditions for a while. However, if performance optimization is your goal, then supplementation should be considered.</p><p>Another point: ingredients in food are presumably in more natural ratios than in a synthetically constructed supplement regimen. True. However, our body is highly adaptable and can handle a lot of variation. Besides, common processed foods like french fries and soda are far from natural or balanced.</p><p>Another concern is supplement quality, fillers, and labeling. This is valid. Fillers are problematic, which is why I opt for powders, choose the cleanest possible options, and stick with reputable brands. Risks, such as heavy metal contamination, do exist, but they are manageable if you monitor important biomarkers regularly. On the flip side - common food products are hardly immune from this problem, from Glyphosate, micro-plastics, heavy metals and other types of contaminants.</p><p>Does it take a lot of work? Some, but not a lot, as I've fine-tuned my routine over the years. I also batch my shakes, preparing and mixing them on the weekend for the week, so the actual breakfast prep takes very little time.</p><p>Hope this helps - in whatever way it does &#128578; </p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Cold Sales Outreach Emails Inefficacy - a sign of upcoming disruption?]]></title><description><![CDATA[Everyone does it, everyone hates it, large companies are built on it (Outreach, Apollo) and it underlines the foundation of CRM systems (like Salesforce and HubSpot). What prompted me to opine here was an accidental feature we developed at Jelled.AI that sorts communications that you don&#8217;t need to pay attention to out. Looking at the results of it - it got me thinking. I will explain.]]></description><link>https://www.equationblog.com/p/cold-sales-outreach-emails-inefficacy</link><guid isPermaLink="false">https://www.equationblog.com/p/cold-sales-outreach-emails-inefficacy</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sun, 12 May 2024 14:15:58 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Everyone does it, everyone hates it, large companies are built on it (Outreach, Apollo) and it underlines the foundation of CRM systems (like Salesforce and HubSpot).</p><p>What prompted me to opine here was an accidental feature we developed at <a href="https://jelled.ai">Jelled.AI</a> that sorts communications that you don&#8217;t need to pay attention to out. Looking at the results of it - it got me thinking. I will explain.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>But first - there is no surprise the efficacy of cold outreach emails has been declining (despite all the hacks - that largely do not work) and probably reached the point (with the introduction of AI tools into sales workflows) where the cost to send them is near zero, but the efficacy is an asymptote of zero. In fact I would argue there&#8217;s a high probability of a net negative return as receiving cold emails could drain whatever good will the customer could have had before opening them.</p><p>My personal routine handling inbound sales outreach was as follows:</p><ol><li><p>I never respond to cold sales outreach, except if it&#8217;s from founders</p></li><li><p>I will mark repeated sales outreach email as spam (although that doesn&#8217;t seem to help due to various email distribution tricks people are using) and will block the sender.</p></li></ol><p>Nonetheless - that&#8217;s still annoying.</p><p>So - with the new feature, <a href="https://jelled.ai">Jelled.ai</a> automatically detects inbound sales emails (so far with 100% accuracy) and sorts them out into a separate folder (the AI engine itself keeps the content if you ever need to do a research on it in the future).  As a result - my inbox is super clean and I no longer get annoyed by the all the inbound.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!stWi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!stWi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png 424w, https://substackcdn.com/image/fetch/$s_!stWi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png 848w, https://substackcdn.com/image/fetch/$s_!stWi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png 1272w, https://substackcdn.com/image/fetch/$s_!stWi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!stWi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png" width="1456" height="839" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:839,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:368798,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!stWi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png 424w, https://substackcdn.com/image/fetch/$s_!stWi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png 848w, https://substackcdn.com/image/fetch/$s_!stWi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png 1272w, https://substackcdn.com/image/fetch/$s_!stWi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc0b41aca-5771-4f9f-aadd-5e8eac7b1a7e_3146x1812.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p>However - this brings up several interesting observations. While using LLMs to generate emails is practically free (comparing to other methods involving humans) - getting rid of these emails is also free. Therefor the usefulness of the cold outbound channel is NIL and the only people making money here are from OpenAI.</p><p>The voice channel was already useless (I never ever pick up the phone in the first place if you are not in my address book). Text will largely meet similar fate (including mentions).</p><p>So, if cold outbound is of no value - that means that the value of SDRs is just an expense (the introduction research can be done by AI agents, while the introductions will still need to be done through the influencer network).</p><p>What does this do? Does It invert the SaaS sales model as it empowers the buyer to run a competitive bidding process through AI - which is already aware of company plans, budgets, timelines, etc.?</p><p>This, being on top of other AI advances, brings into question the need for CRM systems&#8212;which really have nothing to do with customer relationship management but are designed to manage the salesforce. Humans are forgetful, unorganized, need complex compensation plans, and are generally flawed. AI-first designed systems will obviously not have any of those problems. Additionally, as we have fewer humans, we will need fewer seats&#8212;there goes your pricing model as well.</p><p>Are we going to see an upheaval in the world of Salesforce? Does it apply to other professionals (in marketing for example)?</p><p>Something possibly to keep an eye on in the near future.</p><p>Thoughts welcome!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[A quick look at some popular AI predictions]]></title><description><![CDATA[Recently, there have been a number of predictions voiced in several respected channels to which we all pay attention.]]></description><link>https://www.equationblog.com/p/a-quick-look-at-some-popular-ai-predictions</link><guid isPermaLink="false">https://www.equationblog.com/p/a-quick-look-at-some-popular-ai-predictions</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Mon, 04 Mar 2024 03:26:27 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/08a77778-fa14-45b3-807e-cf609c4d2aee_1024x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Recently, there have been a number of predictions voiced in several respected channels to which we all pay attention.</p><p>A quick evening note here&#8212;without going too deep or referencing sources and benchmarks&#8212;on the 3 claims that I am personally skeptical about.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h4><strong>Open source models will catch up with closed source models</strong></h4><p>The argument goes something like this: "The web is finite, therefore all data is/will ultimately be available for training, and what is not available (proprietary data) will remain a small fraction and thus of little impact. From there, it follows that if all models are trained on the same data, the performance will inevitably converge, and therefore there is not much value add in proprietary models."</p><p>The thing is, even if we assume that most relevant data is freely available (various legal issues aside), it takes a lot of resources to train and update the base model and even more resources to instruct it to make it useful (considering all the infrastructure setup and humans involved). While off-the-shelf models like LlamaN&#8230; may work in some contexts, it is unlikely to form a foundation for a true competitive advantage.</p><p>What can alter this dynamic? Perhaps new architectures and approaches to training and fine-tuning, making the process a lot less resource and cost intensive.</p><h4><strong>LLM utility is limited to text generation</strong></h4><p>(or image generation)</p><p>From a first principles standpoint, this is true. We see that auto-regressive generation (LLMs) is "an exponentially divergent diffusion process, hence not controllable."</p><p>However, a combination of LLM with pure logic software may be able to yield good planning/reasoning outcomes, albeit not yet broadly generalizable.</p><p>For example, having an LLM generate code, attempt to run/fix that code, and later evaluate the output demonstrates this is feasible. In addition, this limitation is so well understood and so many people are working on solving it that inevitably&#8212;scalable solutions will come, or at least it&#8217;s not unreasonable to bet on it.</p><h4><strong>LLMs are too slow for real-time user workflows</strong></h4><p>This is true on the face of it, and a massive improvement is unlikely in the medium term.</p><p>However,</p><p>there are a lot of (if not the majority of) workflows, especially enterprise workflows, that are not real-time and do not need to be real-time - communication, document generation, analytics, and many others). So, not only can we take our sweet time, but we can also execute very complex, multi-step LLM workflows without issues (except for cost). In fact, the cost right now is a much bigger problem.</p><p>There are also a ton of performance tricks one can play (from caching, progressive enhancement, different models for different tasks) to make the system look a lot more performant than it really is.</p><p>Happy Sunday and to the productive Monday,</p><p>Ruslan</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[The uncertain AI landscape]]></title><description><![CDATA[It has been more than a year since the release of ChatGPT and the ensuing repositioning of the industry.]]></description><link>https://www.equationblog.com/p/the-uncertain-ai-landscape</link><guid isPermaLink="false">https://www.equationblog.com/p/the-uncertain-ai-landscape</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sun, 31 Dec 2023 15:15:08 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>It has been more than a year since the release of ChatGPT and the ensuing repositioning of the industry. I jotted down some quick <a href="https://open.substack.com/pub/ruslan/p/cambrian-explosion-in-ai-sunday-night?r=jthe7&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcome=true">notes</a> at the time, and they seem not to have aged all that poorly. Yet, it is time to check the pulse as the year ends.</p><p>These days, everyone is a Nostradamus when it comes to AI, and I am going to try to avoid direct predictions. The future is always uncertain, yet some clusters of opportunity are still visible.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>To refresh the basics, the main breakthroughs so far have come from:</p><ol><li><p>Compute (training/inference) and the ability to store and handle large amounts of data</p></li><li><p>Development of Word2Vec and subsequent advances in efficiently learning word embeddings</p></li><li><p>Transformer architecture that dispensed with the sequential processing of RNN-type architectures, enabling the practical training of large models</p></li></ol><p>The main unknown factor was when exactly the compute and the data would lead to breakthroughs. Just like with CNNs for vision, there turned out to be a threshold past which things just started to work.</p><p>Again, it's important to remember that all an LLM does is predict the next token. So, how is it exactly useful?</p><p>Well - isn't that what people do? That's why hallucination is not a bug, but the key feature. One could conflate the LLM temperature setting (a hyperparameter used to tweak the probability distribution for selecting the next word) with the number of drinks at a party.</p><p>It has also been fascinating to observe the evolution of open-source LLMs and various claims of models of GPT 3.x quality (interestingly, no one so far has reached GPT-4 quality even according to benchmarks - and one should be very skeptical of benchmarks at this stage). Why is that?</p><p>Well - let's look deeper at what influences LLM quality:</p><ol><li><p>The dataset to train the base model: if your base dataset contains clean, large, high-quality data - such as well-written books, legal libraries, scientific articles, as opposed to data scraped from the web - your next-word predictions will be more aligned with what Seneca would have said versus a modern-day guru on Twitter/X.</p></li><li><p>The amount and quality of effort put into instructing the model - that is, old-fashioned continuous human-assisted feedback.</p></li><li><p>The ability to train a large enough model, appropriately matched (in terms of the number of parameters) to the size of the dataset.</p></li></ol><p>The bottleneck is actually not (3), but (1) and (2). Training the model is mostly just money and fixed time (not to minimize the effort going into optimizing and managing training pipelines, but it is still reasonably straightforward), while (1) and (2) require a lot of custom infrastructure (both compute and human), sufficient user feedback (how many users are using the platform), a lot of (indeterminable) time and is quite error prone. The outcome is highly sensitive to quality data and feedback. Finally, it inevitably raises a number of legal and data ownership issues. Most importantly, these advantages (or lack thereof) will compound over time, making it largely a winner-take-all game.</p><p>So - how probable is it that open-source LLMs will catch up? Not very. That&#8217;s not to say that there&#8217;s no room for smaller task-oriented models (i.e., classification tasks) - but it is hard to see, short of a seismic shift in how data is shared and infrastructure is made available, how open-source/small players will be practically viable in a general-purpose sense.</p><p>I should also mention the dangers of overdoing the fine-tuning step. You can kind of see that with the constant flux of quality with well-known public models as they kept being fine-tuned for safety, etc. Fine-tune it long enough - and you will get mostly canned answers (an phenomenon known as catastrophic forgetting. Fine-tuning in general, without the access to the original data and training artifacts is tricky (for obvious basic reasons)  - that&#8217;s why releasing &#8220;open-source&#8221; model weights without releasing the original dataset is hardly &#8220;open&#8221;.</p><p>Now - what about AGI that people are so concerned about? Here, you must agree with Jedi Master Yann LeCun that auto-regressive generation (LLMs) is &#8220;an exponentially-divergent diffusion process, hence not controllable&#8221;. Hence, ultimately, a new architecture, combined with a practical way to efficiently learn and optimize to a world model, capable of hierarchical planning, is needed. So far, this is nowhere in sight.</p><p>In the meantime, we have to do the latter part by hand, via software (even though written with assistance from LLMs), where LLMs become like building blocks in the operating system.</p><p>What does this mean then for the ecosystem and for us as humans? Here are some possible outcomes:</p><h3>Jobs</h3><p>While everyone is talking about many jobs that will become unnecessary for humans to perform, a more important question is: what kind of jobs will be created instead?</p><p>It is already clear that &#8220;clean high-quality&#8221; data is the new oil. Human expertise that can be put to use to improve these data (in a broad sense, including expert fine-tuning) is necessarily going to be valuable. Hence, it is not a stretch to imagine people will be paid to instruct the models.</p><p>For example, a legal brief can be presented to several top lawyers for a review, or a code PR can be presented to top coders for review and be incorporated into the model. Tools and services that enable that will therefore be new machine tools for humans. Of course - AI will determine how much those humans will be paid, possibly in AI usage credits &#128578;</p><p>The inept regulation could become a monkey wrench for the ecosystem and the progress - yet, if done reasonably well, could take society to the next level.</p><h3>Model Wars</h3><p>It is hard to see open-source or second-tier LLMs becoming practical due to data and effort limitations, and it is not likely that hardware advancements will be overly impactful (considering the constant need to improve quality/performance/costs, and there&#8217;s still a long way to go there).</p><p>Furthermore - if, shall we say, a major player has a large, good-quality LLM with all the datasets and infrastructure - it is easy to downsize it and produce special-purpose small models almost at will.</p><p>The overlooked point here is that when one has the infrastructure to collect and manage data and feedback - that affords an interesting advantage at scale. Companies like <span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Alexandr Wang&quot;,&quot;id&quot;:17270714,&quot;type&quot;:&quot;user&quot;,&quot;url&quot;:null,&quot;photo_url&quot;:&quot;https://bucketeer-e05bbc84-baa3-437e-9518-adb32be77984.s3.amazonaws.com/public/images/a918fd05-6be2-4ddd-a4e6-d523d2e82ddd_400x400.jpeg&quot;,&quot;uuid&quot;:&quot;338bb503-741e-47de-8313-a04f0ff96eb5&quot;}" data-component-name="MentionToDOM"></span> &#8216;s <a href="https://scale.ai">Scale.ai</a> could be extremely well-positioned here in the long run.</p><h3>The Software</h3><p>Until radical new architectures come into play, a combination of LLM pipelines will need to be used to construct applications. Dynamic performance will be increasingly important as LLMs continue to be exceedingly resource-hungry, and multiple interactions are required to produce a good-quality outcome. What are the components of such frameworks:</p><p>- Inference pipeline execution/orchestration - both in streaming and batch fashion, focused on controlling response times, failure management, costs, and routing between different task-specific models.</p><p>- The routing logic needs to be a lot more sophisticated, controllable, and self-programmable.</p><p>- The usual conveniences for prompt management, context, history, tools, and RAG.</p><p>- Multi-modal/image/video side of the world will more commonly need to be incorporated.</p><p>- User feedback response and dataset management for fine-tuning.</p><p>- System performance evaluation/testing in a continuous fashion, especially when more advanced fine-tuning tricks are used (such LwF and EWC).</p><p>The bigger elephant in the room is Python. While super easy to use and convenient, it is arguably a JavaScript/Ruby/Perl-class language in terms of its (non)-safety, extreme error-proneness (that exponentially compounds with larger teams), and (lack of)-performance. So - either a JIT-enabled and a syntactic layer (akin to TypeScript) is going to need to be invented, or we are going to need a lot more multi-language frameworks to go mainstream.</p><p>One interesting evolution to observe will be how much of self-writing code capability will be incorporated natively into the frameworks (and their evolution) themselves to produce a truly LLM-first system.</p><p>Let&#8217;s see how this ages.</p><p>Happy New Year,</p><p>Ruslan</p><p></p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[My current workout routine]]></title><description><![CDATA[The Results from the latest DEXA scan:]]></description><link>https://www.equationblog.com/p/my-current-workout-routine</link><guid isPermaLink="false">https://www.equationblog.com/p/my-current-workout-routine</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sun, 24 Dec 2023 23:23:34 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>The Results from the latest <a href="https://health.ucdavis.edu/sports-medicine/resources/dxa-info">DEXA</a> scan</strong>:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!K4I9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!K4I9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png 424w, https://substackcdn.com/image/fetch/$s_!K4I9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png 848w, https://substackcdn.com/image/fetch/$s_!K4I9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png 1272w, https://substackcdn.com/image/fetch/$s_!K4I9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!K4I9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png" width="1073" height="208" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:208,&quot;width&quot;:1073,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:47726,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!K4I9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png 424w, https://substackcdn.com/image/fetch/$s_!K4I9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png 848w, https://substackcdn.com/image/fetch/$s_!K4I9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png 1272w, https://substackcdn.com/image/fetch/$s_!K4I9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5b7c39ac-70a3-41eb-973f-318de8adcd6b_1073x208.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!x82d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!x82d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png 424w, https://substackcdn.com/image/fetch/$s_!x82d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png 848w, https://substackcdn.com/image/fetch/$s_!x82d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png 1272w, https://substackcdn.com/image/fetch/$s_!x82d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!x82d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png" width="1067" height="315" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:315,&quot;width&quot;:1067,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:63596,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!x82d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png 424w, https://substackcdn.com/image/fetch/$s_!x82d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png 848w, https://substackcdn.com/image/fetch/$s_!x82d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png 1272w, https://substackcdn.com/image/fetch/$s_!x82d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0fe94afd-7aa2-4d0b-8049-320fd366d377_1067x315.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><p><em>&#8220;I don&#8217;t count my sit-ups; I only start counting when it starts hurting because they&#8217;re the only ones that count.&#8221; - Mike Tyson</em></p><p>In my earlier post, "<a href="https://open.substack.com/pub/ruslan/p/my-personal-daily-routine?r=jthe7&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcome=true">My Personal Daily Routine,</a>" I described in detail the types of exercise routines I use and the rationale behind those choices. Here, I am going to outline precisely my current exercise regimen - with a disclaimer that I do change things up from time to time.</p><p>This regimen is based on a blend of several guided <a href="https://www.teambeachbody.com/shop/us?locale=en_us&amp;referralprogramid=6WW&amp;trainername=AmoilaCesar">Beach Body</a> workouts. The choice of Beach Body is purely historical as I am a long time user since P90X came out. There are many other excellent choices.</p><p>In  "<a href="https://open.substack.com/pub/ruslan/p/my-personal-daily-routine?r=jthe7&amp;utm_campaign=post&amp;utm_medium=web&amp;showWelcome=true">My Personal Daily Routine,</a>"  I also go into detail as to why I prefer guided workouts - and yes, you can't listen to the All-In Pod while doing it, on purpose - unless you are doing Zone 2 training on an elliptical).</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>The overall length of the circuit is <strong>10 weeks</strong> (<strong>6 weeks</strong> of it is based on <a href="https://www.beachbodyondemand.com/programs/6-weeks-of-the-work/start-here?locale=en_US&amp;referralprogramid=6WW&amp;trainername=AmoilaCesar">6 Weeks of the Work</a> by Amoila Cesar and <strong>4 weeks</strong> is based on <a href="https://www.beachbodyondemand.com/programs/body-beast/start-here?locale=en_US&amp;referralprogramid=BE&amp;trainername=SagiKalev">Body Beast</a> by Sagi Kalev with several routines from <a href="https://www.beachbodyondemand.com/blog/tough-mudder-t-minus-30">T-30</a> by Hunter McIntyre and other programs along with running / elliptical workouts).</p><p>I exercise every day and don't have a formal rest day. That said I will throttle the intensity, when I feel I am not fully recovered (based both on subjective and objective metrics), such as sleep quality, resting HR, HRV, respiratory rate etc.).</p><p>As you have noticed, the workouts are front-loaded in the morning and are of considerable intensity. This is a compromise within my schedule - as I am able to burn enough calories to not worry about the exercise and energy expenditure for the rest of the day.</p><p>A quote from Mike Tyson is important. In order to get an adaptation, we need to push it. So whatever recommended reps and loads are - those are just for reference. You need to continue to push it up (provided you are fully recovered and the form is not compromised). For example, if you did the last set of bicep curls with 30 pounds, go up to 35 next time (even reps go down) - and if you don&#8217;t have 35s and just did 8 reps with 30s and feel like you can do more with good form - well do 2-3 more.</p><p>Every workout starts with a 10-15mins <a href="https://docs.google.com/document/d/1qMZIks3aOLaDd5jIPpInQbLcvFJsCt8s_d5yHH_JQkM/edit?usp=sharing">foam rolling routine</a> and I never skip it. Makes me feel like a teenager all day long.</p><p>The week for me starts on <strong>Sunday</strong>.</p><p><strong>The first 6 weeks</strong>:</p><p>The 6-weeks of the work program consists of 3 blocks of 2 weeks each comprising of a mix of 5 different functional routines followed by a 20-25 mins recovery routine on the 6th day with 1 rest day in the middle. As I mentioned earlier - I don&#8217;t do rest days, therefor I use a modified schedule.</p><p><strong>Sun</strong>, <strong>Mon</strong>, <strong>Tue,</strong> <strong>Wed</strong>, <strong>Thu</strong> is the appropriate sequential workout from 6-weeks of the work with a requisite recovery routine (called Range &amp; Repair) followed right after the last weekly workout on <strong>Thu</strong>.</p><p><strong>Fri</strong> - whatever I feel like a full body workout or an MMA-focused workout (could be as free form as practicing Krav Maga moves), optionally followed by a 30-mins Zone 2 on an elliptical (depending on how intense / long was the main workout).</p><p><strong>Sat</strong> - 50m elliptical Zone 2 workout followed by a T-30 chin-up routine, followed by T-30 Sheriff Abs, and ending with a short stretch.</p><p>In addition (after the main workout):</p><p>On <strong>Sun</strong> - a short interval run (or a simulated run using an elliptical) consisting of 3m warm-up jog, 9 intervals of 1 min on / 1 min off followed by a cool down, followed by a 5-min stretch.</p><p>On <strong>Wed</strong> - a long intervals run (or a simulated run using an elliptical) consisting of a 3m warm-up jog,  3 intervals of 5 min on / 2 min off  followed by a cooldown and a 5 min stretch.</p><p>On <strong>Mon</strong> for weeks 1, 3 and 5 (the 6-weeks of the work legs routine) - a T-30 pull-up routine, followed by a 10min Ab-routine from Insanity Max 30.</p><p>On <strong>Tue</strong> for weeks 2, 4 and 6 (6 weeks of the work Cardio &amp; Core) followed by a 30min Zone 2 on an elliptical. Cardio &amp; Core is the easiest workout in the series and the day overall feels like a rest day.</p><p>I am sharing the exact weight and rep ranges for me <a href="https://docs.google.com/document/d/15WEBWvQ095jwGwg1DS_jHmzMD2OrZr9DgTgV7P7VjFA/edit">here</a>. On T-30 pull / chin ups I normally go up to 12 reps on the ladder (unassisted). Of course this is with a maximally good form and with a minimum amount ego, making these weight / rep ranges quite challenging every time.</p><p><strong>The last 4 weeks</strong>:</p><p><strong>Sun</strong>: Beast-Up Chest/Shoulders/Triceps followed by T-30 short intervals.</p><p><strong>Mon</strong>: Beast up Legs, followed by a T-30 pull-up routine.</p><p><strong>Tue</strong>: Beast Cardio followed by Beast Abs optionally followed by a 30min Zone 2 on an elliptical.</p><p><strong>Wed</strong>: Build Back/Biceps optionally followed by a 30min Zone 2 on an elliptical. This is one of the toughest workouts in the series and Zone 2 here is truly optional.</p><p><strong>Thu</strong>: Bulk Shoulders (Thursday #1), Bulk Chest (Thursday #2), Bulk Arms (Thursday #2), Build Chest / Triceps (Thursday #4) or a similar auxiliary lift followed by T-30 long intervals.</p><p><strong>Fri</strong>: Beast Total Body or if feeling really good, <a href="https://www.beachbodyondemand.com/programs/a-week-of-hard-labor/start-here?locale=en_US&amp;referralprogramid=WHL&amp;trainername=SagiKalev">Week of Hard Labour</a>: Total Body (another program by Sagi Kalev) followed by a recovery routine.</p><p><strong>Sat</strong> - 50m elliptical Zone 2 followed by a T-30 chin-up routine, followed by a T-30 Sheriff Abs routine, and ending with a short stretch.</p><p>Unlike 6-weeks of the Work routines - these are done with heavy weights - and I mean heavy.</p><p>Why it may look like the last 4 weeks are more demanding - they are not. 6-weeks of the work type workouts focus on compound and power movements putting a lot of stress on a neuromuscular system, so these 4 weeks feel like a break.</p><p>All of this is not set in stone, and I do vary and experiment with things from time to time. When I have to shorten the workout, the ab routines get skipped as they are the least essential.</p><p>Enjoy!</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[The mind routine]]></title><description><![CDATA[&#8220;The grass is always greener on the side that is fertilized with bullshit&#8221; - Experiential wisdom]]></description><link>https://www.equationblog.com/p/the-mind-routine</link><guid isPermaLink="false">https://www.equationblog.com/p/the-mind-routine</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Thu, 23 Nov 2023 15:00:54 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!vSpF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em><strong>&#8220;The grass is always greener on the side that is fertilized with bullshit&#8221;</strong>&nbsp; - Experiential wisdom</em></p><p><strong>DISCLAIMER</strong></p><p><em>This content focuses on the subject of mental fitness and its impact on performance, reflecting solely my personal experiences. Please understand that the tools and methods mentioned are not intended as medical or any other kind of professional advice.</em></p><p>Everyone faces challenges in maintaining emotional balance under life's stresses, whether they're outwardly visible or not, and this invariably affects our performance in various professional and life situations. Looking back with hindsight, I can attribute most of my past mishaps to my failure in maintaining emotional balance, which led to less than optimal judgment, poor timing of actions, or ineffective communication with others.</p><p>It took me many years and several attempts to unlock some of these doors, and I've reached a point where I feel it might be useful to share my insights. I'm sharing these thoughts reluctantly, fully aware of our individual differences and the complexity and subjectivity of this topic.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to stay updated</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>The basics</h1><p><strong>&#8220;If you can&#8217;t tell what you desperately need, it&#8217;s probably sleep.&#8221;</strong>&nbsp;</p><p>&#8212;&nbsp;Kevin Kelly&nbsp;</p><p>Before trying to fine-tune our brain's software, it's crucial to ensure that our hardware is in the best possible shape. This involves optimizing our sleep, exercise, and nutrition. I have always tried to take care of these fundamentals, and while they aren't sufficient on their own, they certainly make the rest of the process much easier.</p><p>My personal daily routine, which mainly focuses on sleep optimization, exercise, and nutrition, is documented <a href="https://open.substack.com/pub/ruslan/p/my-personal-daily-routine?r=jthe7&amp;utm_campaign=post&amp;utm_medium=web">here</a> for reference. I recently made several updates there, so take a look.</p><h2>Physiological Stress management</h2><p>Obviously operation or programming of the brain doesn&#8217;t work well under stress. However, there are several well-supported strategies to mitigate this, some of which are deceptively simple yet effective:</p><ol><li><p>Being in nature</p></li><li><p>Breathing (there&#8217;re many methods), the main two I use:</p><ol><li><p>Box / Tactical breathing (called tactical because it is used in combat situations to calm the mind). I use a very simple iOS app called Breethe+ where you can set your own intervals. The classic box breathing would be 4s in 4s hold 4s out 4s hold (4-4-4-4 - the box). The exact inhale/exhale/hold intervals are actually empirically discoverable for each individual. Search for CO2 tolerance test.</p></li><li><p>Physiological sigh: double inhale - long exhale (audibly of possible, almost like going down the slide). The basic idea here is that when exhale is longer then inhale - it lowers adrenaline, the reverse raises it. This technique been used in martial arts since the beginning of time.</p></li></ol></li><li><p>Combining breathing with specific gaze patterns or tactile engagement with objects around you (such as toying with items on a desk during a meeting) can also be helpful.</p></li><li><p>A cold shower or plunge can temporarily increase dopamine levels. While I can confirm its benefits, I don&#8217;t particularly enjoy it. Nonetheless, some people swear by it.</p></li><li><p>Completing an unrelated (but necessary) small task, even if a distraction).</p></li></ol><p>These methods not only influence neurotransmitter activity and heart rate but also allow you to subtly co-opt your autonomic nervous system for better equilibrium.</p><p>Finally, it's worth mentioning the "letting go" technique from David Hawkins's <a href="https://www.amazon.com/Letting-Go-David-R-Hawkins-ebook/dp/B00EY818TQ/ref=tmm_kin_swatch_0?_encoding=UTF8&amp;qid=&amp;sr=">book</a>, which provides another approach to managing stress.</p><h2>Stress conditioning</h2><p>Martial arts training has the unique advantage of preparing you to perform under stress. One of the initial drills I recall vividly involved standing with closed eyes while teammates would strike from various directions. The hits weren't overly painful, but they were certainly noticeable, and the task was to just stand there and take it. It was quite an unsettling experience at first.</p><p>In a similar vein, both the Stoic practice of visualizing negative outcomes&#8212;referred to as "Premeditatio Malorum" (contemplation of future adversities)&#8212;and deliberately setting up challenging situations (fasting, cold showers again, United Economy, long useless meetings - well may be not those ones) are all good methods depending on what it is for you.</p><h2>Mental energy management</h2><p>I would also emphasize (from experience) that mental performance is closely tied to effective energy management. Possibly this is one of the most important aspects to pay attention to.</p><p>When your nervous system is drained&#8212;whether from excessive work-related stress, a series of unfortunate events, or intense physical exertion, especially from demanding full-body functional exercises&#8212;taking a break is often the best course of action. This could mean a change of pace, total disconnection, or simply some time to relax. Those of us with type-A tendencies often overlook the need for rest, but it's crucial to acknowledge that sometimes, the best thing to work on is allowing ourselves to recover. </p><p>Having a stress score from a wearable, such as an Oura ring, can be a good prompt. Not that you don't already know it, but empirical confirmation can serve as a useful behavioral tool to prompt action.</p><h1>Managing thoughts</h1><p><em><strong>&#8220;If You Think You're Enlightened, Go Spend a Week with Your Family&#8221;</strong></em><strong> </strong></p><p><strong>- </strong>Ram Dass</p><p>It would be more accurate to call this section 'Managing Thoughts and Behaviors.' The basic idea is that by consciously replacing undesirable thought patterns and behaviors with desirable ones, we can reprogram our thoughts and eventually our subconscious to be more effective. Cognitive Behavior Therapy (CBT) has been around for a while and exemplifies this concept.</p><p>Of course, it's easier said than done.</p><p>There are clear parallels with sports, particularly martial arts. It might take 30-40 repetitions to learn a new movement, but unlearning a faulty one can require up to ten times more. We can control physical movements relatively easily, practicing slowly, using a mirror, or working with a partner, and these movements aren't overly complex.</p><p>Now, apply this concept to the mind? That requires a lot of training iterations. This is why therapy can take years and yet yield only marginal results. The sheer number of repetitions and variations is overwhelming.</p><p>However, that doesn't mean we shouldn't try.</p><p>I've become a big fan of the Stoic toolkit. After several years of practice, I've found it easy to learn, apply, and quite effective. I like simple things.</p><p>This isn't to undermine CBT or its more advanced forms like DBT, which have their useful tools. Albert Ellis, who developed REBT (a precursor to CBT), was significantly influenced by Stoic philosophy.</p><p>I prefer going to the source. The Romans excelled in combat, enjoyment, and wealth creation. It's a simple, time-efficient framework that's easy to apply.</p><p>The first real book I read on stoicism (outside of school) and apart from occasional skimming through Meditations was <a href="https://www.amazon.com/Practicing-Stoic-Philosophical-Users-Manual-ebook/dp/B085H5R3JJ/ref=tmm_kin_swatch_0?_encoding=UTF8&amp;qid=1699665716&amp;sr=8-1">The Practicing Stoic </a>by Ward Fransworth and I can&#8217;t recommend it highly enough.</p><p>I quickly realized that you do have to actually practice it and then stumbled on another resource that I think is absolutely awesome: <a href="https://www.amazon.com/Handbook-New-Stoics-Week-Week/dp/1615195335/ref=tmm_other_meta_binding_swatch_0?_encoding=UTF8&amp;qid=1699746167&amp;sr=8-1">A handbook for New Stoics</a> by Massimo Pigliucci that is structured as 52-week long lesson journal for each week.</p><p>Finally, I read a page every morning from an excellent book <a href="https://www.amazon.com/Daily-Stoic-Meditations-Wisdom-Perseverance/dp/0735211736/ref=tmm_hrd_swatch_0?_encoding=UTF8&amp;qid=1699746132&amp;sr=8-1">The Daily Stoic</a> by Ryan Holiday.</p><p>I most certainly don&#8217;t feel qualified to explain stoicism, yet I feel I must offer a condensed (while incomplete) list of key concepts and practices to give you a sense on how I apply it:</p><p>To succinctly capture the essence of Stoic philosophy, it is perhaps best expressed in the words of Epictetus himself:</p><p><em>&#8220;There are three things in which a man ought to exercise himself who would be wise and good. The first concerns desires and aversions, that a man may not fail to get what he desires, and that he may not fall into what he does not desire. </em></p><p><em>The second concerns the movements (toward an object) and a movement from an object, and generally in doing what a man ought to do, that he may act according to order, to reason, and not carelessly. </em></p><p><em>The third thing concerns freedom from deception and rashness in judgement, and generally in concerns ascents. </em></p><p><em>Of these topics the chief and the most urgent of which relates to the effects; for an effect is produced in no other way than by a failing to obtain that which a man desires or falling into that which a man would wish to avoid. This is that which bring in perturbations, disorders, bad fortune, misfortunes, sorrows, lamentations, and envy; that which makes men envious and jealous; and by these causes we are unable to even listen to the percepts of reason.&#8221;</em></p><p>- Epictetus. Discourses 3, 21-3</p><p>In plain terms - Stoicism can be divided into 3 disciplines: The discipline of Desire (or wish), the discipline of Action (or impulse to act) and the disciplines of Accent (or judgements).</p><p>Key practices:</p><ol><li><p><strong>Dichotomy of Control</strong>: Recognizing the difference between what is in our control (our own thoughts and actions) and what is not (external events and others' actions), and focusing our energy only on the former. Once you think about it - you realize how little is truly in your control.</p></li><li><p><strong>Mindfulness (Prosoche)</strong>: Maintaining a constant awareness of our thoughts, actions, and feelings in the present moment to live intentionally and virtuously. Training in mindfulness mediation is extremely helpful as it allows to bring up some of the awareness you achieve while quite into a busy life. I actually doubt it&#8217;s otherwise possible purely on the intellectual level,  without a formal meditation training. Almost like you can intellectually think how to punch correctly, but unless you practiced under stress - good luck.</p></li><li><p><strong>Journaling (Meditation in Stoicism)</strong>: Reflecting on one's thoughts and actions through daily journaling, which serves as a tool for self-improvement and self-reflection. My sense is - everyone has to find his or her own path here. I cycled through many formats until I arrived at my own and I am still constantly tinkering with it from time to time depending on circumstances.  The point is to just do it.</p><p>Here&#8217;s my current format in Apple notes:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vSpF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vSpF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png 424w, https://substackcdn.com/image/fetch/$s_!vSpF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png 848w, https://substackcdn.com/image/fetch/$s_!vSpF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png 1272w, https://substackcdn.com/image/fetch/$s_!vSpF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vSpF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png" width="518" height="611" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:611,&quot;width&quot;:518,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:29582,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vSpF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png 424w, https://substackcdn.com/image/fetch/$s_!vSpF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png 848w, https://substackcdn.com/image/fetch/$s_!vSpF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png 1272w, https://substackcdn.com/image/fetch/$s_!vSpF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb37536e0-af95-4f8a-9c68-6354fb6ecadb_518x611.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p></li><li><p><strong>Premeditation of Adversity (Premeditatio Malorum)</strong>: Visualizing potential challenges and setbacks to prepare oneself emotionally and mentally for adversity and to diminish fear of the future. You can visualize both negative and positive outcomes (the latter is not technically Stoic - but could be useful for other purposes). Again - something you do in martial arts (i.e. visualizing an adverse scenario and the execution of various techniques in response). The important part here is not just to think about it - but try to engage all senses (smell, touch, taste) and feelings if possible.</p><p><strong>A word of caution (speaking from experience)</strong>: negative visualization is a double-edged sword, because we really don&#8217;t want to manifest bad things, just prepare for if/when they come. Especially when all senses are engaged it influences &#8220;reticular activating system" (RAS) - a network of neurons that is involved in discriminating incoming information into important versus background noise. I suspect this is where &#8220;make believe&#8221; problems tend to come from.</p></li><li><p><strong>The View from Above</strong>: Practicing a mental shift of perspective, imagining looking at one&#8217;s life from a cosmic perspective to gain a more objective and less egocentric view of events.</p></li><li><p><strong>Negative Visualization (Memento Mori)</strong>: Contemplating the impermanence of life (or just about everything else for that matter, possessions, relationships, money, etc) and the inevitability of death to value the present and live life more fully.</p></li><li><p><strong>Amor Fati</strong>: Embracing and loving fate, which involves accepting and even welcoming the events that occur, as part of the natural order of the universe.</p></li><li><p><strong>Virtue Ethics</strong>: Striving to cultivate personal virtues such as wisdom, courage, justice, and moderation in all aspects of life.</p></li><li><p><strong>Sympatheia</strong>: Acknowledging the interconnectedness of the universe and the kinship of all beings as part of a larger human community. This one has actually been genuinely hard for me.</p></li><li><p><strong>Voluntary Discomfort</strong>: Occasionally practicing self-denial or undergoing hardships to increase one's gratitude for what one has and to build resilience against future discomforts.</p></li></ol><p>Some of these practices may be easier or harder for you. But like Arnold would say - &#8220;you have to work on it&#8221;.</p><p>Trust this is enough of my pontifications on this subject. I would encourage you, if you find this helpful, to go to the original sources. Epictetus and Seneca's letters are great resources.</p><p>There is one more helpful tool in the toolbox I use - and this is from the book <a href="https://www.amazon.com/gp/product/B018SWBWX0/ref=ppx_yo_dt_b_d_asin_title_o07aud_?ie=UTF8&amp;psc=1">Loving what is</a> by Byron Katie. The 4 questions are:</p><ol><li><p><strong>Is it true?"</strong></p></li><li><p><strong>"Can you absolutely know that it's true?"</strong>: a bit of reductio ad absurdum, but it works &#8230;</p></li><li><p><strong>"How do you react, what happens, when you believe that thought?"</strong>.</p></li><li><p><strong>"Who would you be without the thought?"</strong></p></li></ol><p>The process also includes a "turnaround," which involves considering the opposite of the original belief.</p><p>While the framework is super simple, I&#8217;d recommend listening to the entire book on audio specifically for situational examples.</p><p>Hope this helps! Remember, these tools are for programming your neural nets and aren't very efficient because we're attempting to reprogram our behaviors using the conscious mind. Choose methods that feel natural for you, and avoid getting overly wrapped up &#8211; these are just tools in the tool box.</p><h1>Managing emotions</h1><p><em><strong>"No amount of anxiety makes any difference to anything that is going to happen.&#8221;</strong> </em></p><p><em>- Alan Watts</em></p><p>There's a long-running debate about what comes first&#8212;thoughts or emotions. They are likely interdependent, something we have all experienced. You feel something in your gut, you start thinking about it, and one thing feeds the other. This is the interplay between the conscious and subconscious. It highlights how incredibly difficult it is to control the subconscious with a conscious mind and who is really in charge most of the time.</p><p>There&#8217;s a school of thought (and quite persuasive at that) which argues that we have no free will. Either way, we're interested in affecting the programming, which is also ironic&#8212;because it, of course, implies that there's a programmer, which would not be consistent with a non-dualistic worldview. So here you have it&#8212;several cans of worms at once.</p><p>Back to the task at hand. The goal here is to bridge the gap between the emotions (the unconscious) and the thinking mind so that both can be complementary and in coherence as much as possible. After all, we do want to have access to the lower (at least our executive mind thinks they are lower) layers of the stack. You see the analogies in the corporate world, no doubt &#128578;</p><p>The first thing to understand is that we do need to go down the elevator&#8212;that means we have to get out of the Beta brainwave state and drop at least into Alpha, and ideally Theta. That's why I meditate in the morning, just right after waking up (as the brain is still transitioning out of Theta).</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DC9j!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DC9j!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DC9j!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DC9j!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DC9j!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DC9j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg" width="229" height="220" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:220,&quot;width&quot;:229,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;All About Brainwaves - Natural Medicine Center of Lakeland&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="All About Brainwaves - Natural Medicine Center of Lakeland" title="All About Brainwaves - Natural Medicine Center of Lakeland" srcset="https://substackcdn.com/image/fetch/$s_!DC9j!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg 424w, https://substackcdn.com/image/fetch/$s_!DC9j!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg 848w, https://substackcdn.com/image/fetch/$s_!DC9j!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!DC9j!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe14fd4e5-ef4a-49e6-ba77-f86e4a8a25dd_229x220.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>So, meditate first thing in the morning if at all possible, in a quiet place that allows you to settle and relax. That's what I do. Yes, that means absolutely no checking of email, Slack, or Teams (if that's your world). Twitter/X is Ok - just kidding, absolutely not.</p><p>There are methods, of course (such as the countdown technique), to settle the mind, but these can take considerably longer in the middle of a busy day. We are still all about efficiency here.</p><p>As I've mentioned before, I use the excellent Waking Up <a href="https://www.wakingup.com/">app</a> by Sam Harris. There are other options as well, and I&#8217;d say experiment to find what works for you. That said, in my personal opinion (having tried most of them), Waking Up is hands down the superior choice by a mile and is completely free of fluff. At some point, the app also becomes optional.</p><p>It took me years to start noticing the effects (probably at least 1.5 years). I am a slow learner. In fact, for the first six months, I constantly questioned myself&#8212;why am I doing this? It&#8217;s not working. Then, progressively, it clicked into place.</p><p>By far the first benefit is that you start noticing your thoughts and emotions, and that ability to observe becomes stronger and stronger and progressively transfers into daily life as well.</p><p>That noticing opens up additional contemplative opportunities:</p><ol><li><p>Simply an ability to examine thoughts and ideas without overwhelming ego interference</p></li><li><p>Use the power of visualization to set intentions</p></li><li><p>Start appreciating the present moment, improve attention and enhance gratitude, that I think does lead to a greater happiness (or lesser unhappiness)</p></li></ol><p>Try it.</p><p>What about neurofeedback wearables? I've tried several, and none worked for me. Perhaps individual brain circuitry varies, and for some people, external feedback is useful. For others, especially those with meditation training, it just adds an extra layer of delay and thus becomes counterproductive. Who knows?</p><h1>The Universe, Intuition and Intention</h1><p><em><strong>"You are not a drop in the ocean. You are the entire ocean, in a drop."</strong><br>&#8211; Rumi</em></p><p>The nature of the universe and our relationship to it has been a subject of contemplation by many brilliant philosophers and physicists since the dawn of time. Obviously, we cannot claim to know substantive answers&#8212;just as an ant cannot comprehend the life happening above it. Nonetheless, grounding ourselves in a model of understanding, however tenuous or circular it may be, can still be helpful as we navigate, act in, and act upon this world.</p><p>While there are countless books and writings on the subject, they remain largely theories or, more often than not, conjectures&#8212;most of which are inherently <strong>non-falsifiable</strong>. This means they cannot be definitively proven or disproven, placing them in the realm of philosophical exploration rather than empirical science. As such, they must be approached with a healthy dose of skepticism and an understanding of their speculative nature.</p><p>That said, when we look across the breadth of world philosophies and spiritual traditions, unmistakable patterns emerge&#8212;seemingly independently&#8212;and some of these recurring ideas become difficult to dismiss outright. While they may not meet the rigor of falsifiability, their persistence across time and cultures suggests they offer valuable lenses through which to view our experience of reality.</p><p>I&#8217;ll recommend a few resources that I find practical and thought-provoking:</p><ul><li><p><strong>Two books by Rizwan Virk:</strong><br><a href="https://www.amazon.com/Simulation-Hypothesis-Computer-Scientist-Quantum-ebook/dp/B0DKMC5XFY/?_encoding=UTF8&amp;pd_rd_w=MZ1hY&amp;content-id=amzn1.sym.05575cf6-d484-437c-b7e0-42887775cf30&amp;pf_rd_p=05575cf6-d484-437c-b7e0-42887775cf30&amp;pf_rd_r=134-7919916-5052361&amp;pd_rd_wg=GKF4h&amp;pd_rd_r=f5506289-30f5-44c0-90b8-32799e7753bb&amp;ref_=aufs_ap_sc_dsk">Simulation Hypothesis</a> and <a href="https://www.amazon.com/Simulated-Multiverse-Scientist-Simulation-Hypothesis-ebook/dp/B08XFR749T?ref_=ast_author_mpb">The Simulated Multiverse</a>. The central question is: Are we Player Characters (PCs) or Non-Player Characters (NPCs)? </p></li><li><p><a href="https://a.co/d/eeutIjN">A Brief History of Intelligence</a>: Evolution, AI, and the Five Breakthroughs That Made Our Brains by Max Bennet offering a non-metaphysical view on development of human mind </p></li><li><p><strong>Vadim Zeland's </strong><a href="https://www.amazon.com/Reality-transurfing-Steps-Vadim-Zeland-ebook/dp/B00PY8ICSA/ref=sr_1_1?crid=1W27EMCH7ED4P&amp;dib=eyJ2IjoiMSJ9.9AC1pYVYk3IOmPqqH9u0Lvbk23mWymG3Ltz899TdwrxcTTUeE9FOI3fYZ6jFbBt9UG9N8J_ExH2IbOaKVbBGfN_i5P5qLN13Y75loB0u_lRLCS7gKU2xN-cLG7LoPpzZnR5vQrelEmM44PjyGE6mg1GCCLrL-1r7VNTD8fHgAGhK5Ab18aBmAfGr9zRp2cSezjxG82-yQ-xcK6wNyA95rriPtedhdFDF5HwE5k8RNM4.lkvG7NVTkN-DILrOFUjFomgp66WARiHtRFMm1DqalZI&amp;dib_tag=se&amp;keywords=transurfing&amp;qid=1732754109&amp;s=digital-text&amp;sprefix=transsurfing%2Cdigital-text%2C177&amp;sr=1-1">Reality transurfing</a><strong> series:</strong><br>In particular, <a href="https://www.amazon.com/dp/B07KGDQHPB/ref=mes-dp?_encoding=UTF8&amp;pd_rd_w=rFBvK&amp;content-id=amzn1.sym.7d2923e8-7496-46a5-862d-8ef28e908025&amp;pf_rd_p=7d2923e8-7496-46a5-862d-8ef28e908025&amp;pf_rd_r=Y79KYQ9X4Z3HMMM8TBPM&amp;pd_rd_wg=BLjNJ&amp;pd_rd_r=1f0d7db5-60f8-44ea-ac18-1c9daae46404">Transurfing in 78 Days &#8212; A Practical Course in Creating Your Own Reality</a>. This book is similar in format to <em>The Daily Stoic</em>, offering structured daily reflections. At its central is the idea of viewing the world as a mirror&#8212;a concept drawn from many traditions but most closely aligned with Dao Qing, emphasizing harmony and the dynamic interplay between inner intentions and outer reality, a perspective that Zeland&#8217;s work echoes.</p></li></ul><h2>Intuition</h2><p>We all know what intuition is&#8212;the simplest example being a &#8220;gut feeling.&#8221; You can debate whether it comes from within (as some sort of autonomic AI within us) or from external influences&#8212;or perhaps a combination of both. What&#8217;s undeniable is that intuition can be profoundly helpful. There are many who hone this skill, learning to quiet the noise in their minds to reliably tap into their intuitive sense.</p><p>The good news is that there are techniques, many of them rooted in spiritual traditions, that can sharpen your ability to listen to your internal voice. A resource I particularly like is <em><a href="https://www.amazon.com/Intuition-Bible-Into-Inner-Wisdom-ebook/dp/B0CW1MSZ2J/ref=sr_1_1?crid=MDOWD407XW5Y&amp;dib=eyJ2IjoiMSJ9.4wjR4bbHuN-G31avYnzZsoz8mU4K_2sUjqLBnxfELqa6UsjrtMvKPRW5-0ddQ6VfLhgn_ORqKnPh-gjXTBKm3q6r9Y3YKy8eUmRtkfIn3fdaEGPPFMq6kWo7u_VGPAcyyBh64BgaOqtqa483FroGogSHM5vxL4EzqT1Z7CrCIDl4Qf-rQXTXLy61tpK0G8GjVbD212kKYlQ2-nCIOyspNhi6vTM9iGwSZky9vnlNCXY.AeN3aXn8ze9IVoZ1X-q6w5IgsaVxkw-JuzzpuHVNTwg&amp;dib_tag=se&amp;keywords=the+intuition+bible&amp;qid=1732755940&amp;s=digital-text&amp;sprefix=the+intuition+bible%2Cdigital-text%2C184&amp;sr=1-1">The Intuition Bible: How and Why to Tap Into Your Inner Wisdom</a></em> by Happy Ali. It offers a solid list of techniques you can experiment with yourself.</p><h2>Intention</h2><p>Our intentions are the engines that drive us to act. However, the real challenge lies in maintaining sharp focus on our goals, especially in the face of distractions, a racing mind, or low energy. Frederick Dodson&#8217;s <a href="https://www.amazon.com/gp/product/B01JCBIHM0/ref=ppx_yo_dt_b_d_asin_title_351_o02?ie=UTF8&amp;psc=1">Reality Creation and Manifestation</a> outlines a number of focusing exercises that I&#8217;ve found practical. These are well-known techniques that can likely be found elsewhere, but Dodson presents them in a clear, actionable format.</p><div><hr></div><p>Final Thoughts</p><p>Some of the books I&#8217;ve referenced are less than scientific, and they contain ideas and assertions you may not agree with&#8212;or even find contradictory. That&#8217;s okay. You don&#8217;t have to fully align with an author&#8217;s worldview to benefit from their ideas or techniques. Take what resonates, apply it where it&#8217;s useful, and leave the rest.</p><h1>Navigating the path</h1><p><em><strong>"It is not your responsibility to finish the work [of perfecting the world], but you are not free to desist from it either."</strong></em></p><p><em>-l Rabbi Tarfon, Pirkei Avot 2:21</em></p><p>It is self-evident that we, as humans, have an inherent motivation to create, contribute to, and improve the world around us. We know when we are succeeding on that path, and we equally recognize when we are not.</p><p>So, how do we figure it out? The first sign is usually a feeling of being off course, which we recognize after some time. The natural&#8212;and correct&#8212;evolutionary inclination is to change course. But how do we optimize this search and minimize the chances of being misled by the impulses of the ego?</p><h2>3MQ method</h2><p>It represents the three most important questions I learned from Vishen Lakhiani&#8217;s work. The questions are:</p><ol><li><p>What do I want to experience?</p></li><li><p>How do I want to grow?</p></li><li><p>What do I want to contribute?</p></li></ol><p>The format is a one-pager, like <a href="https://blog.mindvalley.com/wp-content/uploads/2018/01/3-most-important-questions-1.pdf?_gl=1*pv1h5y*_ga*NDI3NjExNzYyLjE2OTAxMzUxOTM.*_ga_XY6ERPH200*MTY5OTkwNDg2NS4xNi4wLjE2OTk5MDQ4NzAuNTUuMC4w">this</a> (a Google doc is fine). The answers are, of course, something to be contemplated, ideally several times over a period.</p><p>In all these kinds of contemplations (just like with visualizations), it's important to engage all senses when describing experiences. Literally feel yourself speaking on that stage, hear the audience, describe the shapes of the lights, the temperature in the room, etc.</p><h2>The life timeline method</h2><p>The second method I've found useful is to map out my life's timeline in manageable increments (like 1-2 years), akin to an OPS post-mortem. I jot down important points about these periods and try to identify Kensho and Satori moments.</p><p>In Zen Buddhism, "Kensho" means "seeing one's true nature" and refers to a sudden insight or awakening. "Satori" is similar but considered more profound and enduring. Kensho is often characterized as enlightenment gained through adversity or suffering, whereas Satori is seen as a deeper, more comprehensive awakening or enlightenment. (this is a very non-nuanced explanation&#8212;my apologies). As we examine the timeline, these moments and experiences should become self-evident.</p><p>Finally, an excellent book, "<a href="https://www.amazon.com/Second-Mountain-David-Brooks-ebook/dp/B07DT1BD63/ref=tmm_kin_swatch_0?_encoding=UTF8&amp;qid=&amp;sr=">The Second Mountain</a>" by David Brooks, which was recommended to me, was also quite helpful.</p><p>This discussion would not be complete without talking about what everyone is talking about these days - and that is what&#8217;s the place in this exploration for the use of entheogenic aids.</p><p>Obviously the promise here is bypassing the conscious mind and - back to the AI analogies instead of prompt engineering do the full fine-tuning of the model, with all the requisite risks of course.</p><p>This is not something one should attempt casually and only do it a jurisdiction and within the framework where it is legal and in a safe set and setting, with the appropriate medical evaluation (there&#8217;re several serious counter-indications), supervision and pre/post integration. This type of exploration should be taken very seriously, otherwise it will not only not work, but can cause serious harm.</p><p>Below is a compilation of resources that might be helpful:</p><ol><li><p>The first book everyone reads is <a href="https://www.amazon.com/How-to-Change-Your-Mind-audiobook/dp/B07B1V3RF5/ref=sr_1_1?crid=UEAJ6SEE5J92&amp;keywords=michael+pollan+books&amp;qid=1699914553&amp;sprefix=michael+pol%2Caps%2C169&amp;sr=8-1">How to Change your mind</a> by Michael Pollan</p></li><li><p>Then of course there&#8217;s <a href="https://www.amazon.com/Food-of-Gods-Terence-McKenna-audiobook/dp/B009IBRSJI/ref=sr_1_1?crid=CIWVVKRWWL0I&amp;keywords=food+of+the+gods&amp;qid=1700696699&amp;s=audible&amp;sprefix=food+of+the+god%2Caudible%2C130&amp;sr=1-1">Food of the Gods</a> by Terence McKenna</p></li><li><p>A more practical collection of research and clinical experiences mostly from the before the ban era by <a href="https://www.amazon.com/Psychedelic-Explorers-Guide-Therapeutic-Journeys/dp/B09ZD2HCQY/ref=sr_1_1?crid=3DRY2G4DNXLE9&amp;keywords=explorers+guide+psychedelics&amp;qid=1699914772&amp;s=audible&amp;sprefix=exploerers+guide+psycodelic%2Caudible%2C121&amp;sr=1-1">James Fadiman</a></p></li><li><p>There&#8217;s a lot of emergent research that may be worth paying attention to, a good destination for that is <a href="https://maps.org">MAPS</a></p></li><li><p>Should you want to see what pushing the envelope looks like: <a href="https://www.amazon.com/gp/product/B0848SBF1M/ref=ppx_yo_dt_b_d_asin_title_129_o05aud_?ie=UTF8&amp;psc=1">LSD and the Mind of the Universe</a>: Diamonds from Heaven by Christopher M, Bache.</p></li></ol><p>In the vein of ancient Greek philosophers, the specifics of such journeys tend to remain unspoken, varying widely from individual to individual. Nevertheless, here's what one might generally expect:</p><ol><li><p>Allocating around 5 weeks for a well-structured process seems to be a common practice:</p><ol><li><p>The first two weeks are for preparation. It's usually recommended to continue with regular work but to avoid unusually stressful situations. Abstaining from alcohol is required, along with maintaining a healthy diet (which one should do anyway). The real challenge is the need to forego coffee.</p></li><li><p>A week of actual retreat.</p></li><li><p>Following this, two weeks of post-integration are necessary. With increased brain neuroplasticity during this time, minimizing stress and focusing on contemplation is more crucial.</p></li></ol></li><li><p>Approaching this experience with intention, yet being open to new insights, is often advised. A three-session protocol within a week has been known to be effective for allowing these realizations to emerge and facilitating thorough exploration.</p></li><li><p>It is often noted that the future remains a mystery, filled with myriad possibilities. This resonates with the belief that the Universe&#8212;or any higher power one subscribes to&#8212;is in a perpetual state of exploration, with each person acting as an explorer.</p></li><li><p>The dissolution of ego, acute focus, and a clear view of one&#8217;s life history enable an unfiltered perception of reality, often described as liberating from the usual mental noise. This can be challenging at times, but many report a clarity where excuses and justifications simply vanish, leading to an acceptance of reality as it is. In such moments, the relevance of preparation is frequently recognized in hindsight.</p></li><li><p>The path often becomes clearer during this journey&#8212;gaining an understanding of one's purpose, the reasons behind certain life events (not in a deterministic sense, but in a way that renders the unfolding of events logical, given the inputs and circumstances), and the potential paths ahead.</p></li><li><p>The high degree of focus may persist for several weeks.</p></li><li><p>Regular meditation practices tend to improve significantly, sometimes enabling brief re-experiences of the journey&#8217;s state.</p></li><li><p>A noticeable increase in kindness towards others, compared to one's usual behavior, is often observed.</p></li><li><p>It is suggested to keep a notebook close for reflections during and immediately after the experience. While the memories may remain intact, capturing the clarity of specific insights could be valuable.</p></li></ol><p>The final question here is&#8212;can this be achieved without pharmacology? While the answer is obviously yes, as centuries of spiritual traditions confirm it, the practicality can be a challenge.</p><p>Some options include long meditation retreats, holotropic breathing sessions, and insights could also result from Kensho-type moments. However, none of these are reliable, time-efficient, or (in the case of Kensho moments) necessarily desirable (or predictable).</p><p>Hope this helps.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to stay updated</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[My coffee fast experiment]]></title><description><![CDATA[I love coffee.]]></description><link>https://www.equationblog.com/p/a-coffee-fast-experiment</link><guid isPermaLink="false">https://www.equationblog.com/p/a-coffee-fast-experiment</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sun, 24 Sep 2023 14:15:03 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!tiAx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>I love coffee. I always have.</strong> The idea of taking a break from it never originated from me, unthinkable really. But, I found myself in this situation when I signed up for a retreat in Colorado. Part of the preparation for this retreat was to follow several restrictions for two weeks. This included abstaining from coffee, something I learned about only after signing up.</p><p>I'd certainly heard about coffee withdrawal syndrome from others, and that it's not pleasant. Given this, I wasn&#8217;t eager to experience it. However, being the biohacker that I am, I viewed this as an opportunity for a self-study.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h3><strong>My Normal Coffee Routine</strong></h3><p>As I've mentioned, I love coffee. I drink it black, no sugar, often a robust Americano made from 2 double espresso shots. I savor its taste, akin to the way a wine connoisseur would. My go-to is <a href="https://www.intelligentsia.com/products/black-cat-classic-espresso">Black Cat</a> Espresso, but I do enjoy Taiwanese coffee when I can find it.</p><p>Here's a breakdown:</p><ul><li><p><a href="https://www.amazon.com/gp/product/B00QYZ6MLG/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Legion</a> Pre-Workout drink (1/2 serving): 175 mg (early AM)</p></li><li><p>Americano (2x double espresso): 280mg (for breakfast, using an average double espresso estimate of 120-160 mg)</p></li></ul><p>On occasion, I might have another coffee after lunch. But these instances are rarer these days due to its impact on my sleep quality, not in terms of how long or how quickly I sleep, but the depth and quality of it, as shown by my <a href="https://ouraring.com/?c_correlation_id=09300e5c75e54fd291cb36c78486665a&amp;c_tenant_id=src_1kYsAcdpfzbZ8UlNLYht1RPg3m2&amp;g_acctid=553-919-5922&amp;g_adgroupid=&amp;g_adid=&amp;g_adtype=none&amp;g_campaign=pmax_prospecting-retargeting_integrated_allgeos_purchase_english&amp;g_campaignid=17714554930&amp;g_keyword=&amp;g_keywordid=&amp;g_network=x&amp;utm_campaign=pmax_prospecting-retargeting_integrated_allgeos_purchase_english&amp;utm_content=sleeplab_prospecting_allgeos_en_static_video&amp;utm_medium=cpc&amp;utm_source=google&amp;utm_source=google_pmax&amp;gclid=CjwKCAjwmbqoBhAgEiwACIjzEPlf0Bjg46n7KF_vmRTVZhLHRVPG184yEcaGQRngBjJLvVxM5vEK5xoCwksQAvD_BwE">Oura</a> ring data.</p><p><strong>The Ramp-Down Protocol</strong></p><p>To cut back on caffeine, I followed these steps:</p><ul><li><p><strong>4 weeks</strong>: No second coffee during the day.</p></li><li><p><strong>3 weeks</strong>: Substituted my Americano with a decaf version (still Black Cat Espresso) and no coffee after lunch.</p></li><li><p><strong>2 weeks</strong>: Shifted to a non-caffeinated version of the <a href="https://www.amazon.com/gp/product/B07KFLM35W/ref=ppx_yo_dt_b_search_asin_title?ie=UTF8&amp;psc=1">Legion</a> drink.</p></li></ul><p>The first and second steps, particularly the switch from caffeinated to decaf Americano, were smooth. Completely cutting off caffeine led to a mild discomfort around day two, but it wasn&#8217;t as dreadful as I'd anticipated.</p><h3><strong>Subjective Experience</strong></h3><p>The absence of coffee brought noticeable changes. My workouts, especially in power lifting moves, weren&#8217;t as good (quite lousy actually to be frank). My thinking felt slightly slower.</p><p>However, the days did have a notable calming effect. My focus sharpened and my ability to get into the flow of tasks improved.</p><p>But, even when using the same brand, decaf didn't quite hit the mark in taste compared to its caffeinated counterpart.</p><h3><strong>Objective Data</strong></h3><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tiAx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tiAx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png 424w, https://substackcdn.com/image/fetch/$s_!tiAx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png 848w, https://substackcdn.com/image/fetch/$s_!tiAx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png 1272w, https://substackcdn.com/image/fetch/$s_!tiAx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tiAx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png" width="1456" height="1218" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1218,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:325255,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!tiAx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png 424w, https://substackcdn.com/image/fetch/$s_!tiAx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png 848w, https://substackcdn.com/image/fetch/$s_!tiAx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png 1272w, https://substackcdn.com/image/fetch/$s_!tiAx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feef2ec64-1e61-40e1-a9e0-490d6d9a37a6_2180x1823.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p></p><div class="image-gallery-embed" data-attrs="{&quot;gallery&quot;:{&quot;images&quot;:[{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ee104b76-418f-42dd-97db-dda66d7d80a2_603x1304.jpeg&quot;},{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/721dc951-7a7d-4630-b387-a98b577de47c_603x1304.jpeg&quot;},{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/81615c8e-1168-4df2-9d14-dd87f3f01c65_603x1304.jpeg&quot;},{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9ed722f7-df88-4f67-b807-33483698e28a_603x1304.jpeg&quot;},{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2038a64a-d761-4c5b-914f-1d5bf1f0748d_603x1304.jpeg&quot;},{&quot;type&quot;:&quot;image/jpeg&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2dc66db0-944f-4931-81fd-695ab0bb0983_603x1304.jpeg&quot;},{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d9e86a8c-1e91-4dd8-8434-ab8e13cac60c_1284x2778.png&quot;}],&quot;caption&quot;:&quot;&quot;,&quot;alt&quot;:&quot;&quot;,&quot;staticGalleryImage&quot;:{&quot;type&quot;:&quot;image/png&quot;,&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e0d0aac4-b26e-40f9-b23f-f15f122ce13c_1456x1946.png&quot;}},&quot;isEditorNode&quot;:true}"></div><p></p><p>The data is somewhat hard to interpret and comes off as noisy; the experiment likely needed more time for clearer results. The Oura ring, though good in comparison to other devices, isn't entirely accurate, particularly regarding sleep stages. Their recent statistics indicate a 79% agreement with polysomnography, which isn't as impressive as one might hope.</p><p>With those disclaimers in mind, you can see a discernable improvement in sleep quality and especially deep sleep. Interestingly, sleep latency didn't seem to be affected, suggesting that mechanisms beyond just the blockage of adenosine receptors, like hormonal regulation, might be at play.</p><p>My blood pressure exhibited a significant drop, which could be a potential reason for the decrease in workout performance. It's worth noting that I measure my blood pressure using the Omron 10 Series monitor,  in the mornings. Since I don't strictly adhere to guidelines like remaining still  for a while before taking measurements (in the interest of time),  the absolute readings are somewhat higher (by 5-8 points) than a precise clinical measurement.</p><h3><strong>Ramp Back Up Protocol</strong></h3><p>Getting back on caffeine mirrored the wind-down:</p><ul><li><p>First up was the caffeinated Legion pre-workout.</p></li><li><p>A week later, my regular Americano returned.</p></li></ul><p>With the reintroduction, I observed:</p><ul><li><p>A bounce-back in workout quality.</p></li><li><p>Faster thinking.</p></li><li><p>A delightful Americano experience once again.</p></li></ul><p>My blood pressure saw a slight uptick but remained below my initial readings. The sense of calm I experienced earlier started to diminish, as did some of the increased focus. While sleep quality did decline a bit, it was surprisingly still better overall.</p><p>And all this was after the retreat, which could have influenced some of these observations.</p><h3><strong>Parting Thoughts</strong></h3><p>The big takeaway? Living without coffee is feasible. The effects of withdrawal are manageable. Yet, there are trade-offs:</p><ul><li><p><strong>Sleep</strong>: Coffee, even just in the morning, does affect it.</p></li><li><p><strong>Focus</strong>: There's a definite improvement without caffeine.</p></li><li><p><strong>Mental Speed &amp; Workouts</strong>: They thrive with caffeine.</p></li><li><p><strong>Decaf</strong>: Doesn&#8217;t taste quite the same.</p></li></ul><p>To wrap up: Would I quit coffee entirely? Hell no. But considering breaks based on my observations? It's a possibility.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[Team performance: Meditations on Performance Reviews]]></title><description><![CDATA[Firing up my time machine, I can vividly recall my first averse reaction to the concept of performance reviews as if it were yesterday.]]></description><link>https://www.equationblog.com/p/team-performance-meditations-on-performance</link><guid isPermaLink="false">https://www.equationblog.com/p/team-performance-meditations-on-performance</guid><dc:creator><![CDATA[Ruslan Belkin]]></dc:creator><pubDate>Sun, 07 May 2023 14:00:24 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!-MQk!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0187b1d4-c470-4ffa-a463-efc198a6864f_1280x1280.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Firing up my time machine, I can vividly recall my first averse reaction to the concept of performance reviews as if it were yesterday. It was a sunny day around noon during an all-managers meeting in the cafeteria on the second floor of a super-fast-growing company in the late 2000s. We had recently hired a new CEO, and the topic of the meeting was an introduction of a company-wide performance review process. Now that I think about it, I am surprised that it took so long into the life of the company for the concept to be introduced. But admittedly, we had other things to focus on.</p><p>Until that time, as an individual contributor in previous companies, I had experienced the performance review process, but it never registered. Like most people, I would look at the compensation adjustment line and the rating, ignore all else, and move on.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>This time, as an engineering leader, the full rollout of the performance review process, complete with 360s, ratings, ratios, calibration sessions, and the SuccessFactors software, caused some visceral feelings. Here are some things that went through my mind in that meeting:</p><ol><li><p>Time commitment - while it was being sold (as all new processes are sold that way) as &#8220;lightweight,&#8221; even then, it was clear to everyone that it wasn&#8217;t going to be. Mind you, being understaffed and overworked wasn't even beginning to describe the degree of burnout we then experienced. After all, the 9/9/6 schedule is a Silicon Valley invention, and we had a 9/9/6 Plus (we loved it though in all fairness).</p></li><li><p>We had great and highly complementary teams in terms of skills - it was a head-scratcher for me how I would rate people in that situation where there were no obvious underperformers, and everyone was giving it their all.</p></li><li><p>I was most worried about the downsides of the process. Hiring was extremely difficult in those days, and losing a key engineer or two because of an artificial rating almost guaranteed a major adverse impact since we had single points of failure just about everywhere.</p></li></ol><p>Being rather naive at the time, I shared my reservations in that meeting and asked why we needed to focus on performance reviews. I still remember the answer that is worth pondering on: &#8220;Because it is Best Practice.&#8221;</p><p>Let this sink in - "We are going to be implementing X because it is a Best Practice.&#8221; Take your favorite pick where we heard a similar explanation or a variant of it. To this day, I keep failing to find any evidence of this practice being positive at all, let alone &#8220;best,&#8221; from whatever scant research there has been.</p><p>Perhaps someone has done or is going to do a well-designed study (even a retrospective one) on the topic - if you know of one, please point me to it. Here's what I have observed in my experience across several companies (as well as heard from other managers and executives):</p><ol><li><p>There seems to be an uptick of regrettable departures (people we don&#8217;t want to quit) right after a performance review cycle.</p></li><li><p>Performance calibration meetings are one of the more toxic events in the company, often preceded by a massive amount of horse-trading among managers and executives and a lasting ill will afterward.</p></li><li><p>Compensation adjustments following the reviews are always below what people expect and are (on average) a negative retention driver (if you want to retain someone for comp reasons, you better adjust the comp out of cycle).</p></li><li><p>The amount of time spent on the process is material in terms of delivery and focus, and of course, those magically fall into the end of the quarter. I would estimate at least 1 full month of productivity lost.</p></li><li><p>No one likes the performance review process (not to be confused with what you hear from HR surveys - when people say they want feedback through the review process - they want a raise or a promotion).</p><p></p></li></ol><p>Now, to be dialectical about this problem, let's look at it from the point of view of a CEO. Why would a CEO need to have a performance review process within the company? There are several good reasons:</p><ol><li><p>We need to understand how to manage compensation adjustments within the company, hence we need to understand the performance rankings.</p></li><li><p>Similarly, we need to make sure we manage out underperformers on a regular cadence.</p></li><li><p>We may have managers within the company who are new or inexperienced and not doing performance management.</p></li><li><p>We want to put some objectivity into the data - and not rely on managers only, hence the 360 feedback.</p></li><li><p>We want to have a process/documentation to manage performance for compliance reasons.</p></li><li><p>As the company grows, the individual performance matters less than elevating an average, hence we&#8217;re less concerned with the impact of regrettable attrition - but rather with managing the average performance of the organization (the usual curse of a big corporation).</p></li><li><p>We just hired a new Chief People Officer &#128578;</p></li></ol><p>Any process that involves people is an attempt to program people's behavior towards specific goals. It works best when groups are small and goals are clear. When groups are small, you don't need performance reviews. In a bigger company, I would argue that neither is true, nor are objectives clear (a fair distribution of a token comp adjustment, an overdue promotion, or identification of already obvious bottom performers are hardly that).</p><p>So, if it is about programming organizational (and individual) behavior, we are going to need to approach it from first principles and in more subtle ways.</p><p>To be continued.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.equationblog.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading The Equation! Subscribe for free to receive new posts.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>