<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Cloud - Azalio</title>
	<atom:link href="https://www.azalio.io/category/cloud/feed/" rel="self" type="application/rss+xml" />
	<link>https://www.azalio.io</link>
	<description>Your technology partner</description>
	<lastBuildDate>Wed, 17 Jun 2026 10:59:23 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.6.5</generator>

<image>
	<url>https://www.azalio.io/wp-content/uploads/2021/12/cropped-logo@3x-32x32.png</url>
	<title>Cloud - Azalio</title>
	<link>https://www.azalio.io</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Z.ai pitches GLM-5.2 for long-running software engineering tasks</title>
		<link>https://www.azalio.io/z-ai-pitches-glm-5-2-for-long-running-software-engineering-tasks/</link>
		
		<dc:creator><![CDATA[Azalio tdshpsk]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 10:59:23 +0000</pubDate>
				<category><![CDATA[Cloud]]></category>
		<guid isPermaLink="false">https://www.azalio.io/z-ai-pitches-glm-5-2-for-long-running-software-engineering-tasks/</guid>

					<description><![CDATA[<p>Z.ai has released GLM-5.2, an MIT-licensed open-source AI model designed for long-running software engineering tasks, as the Chinese company seeks to challenge proprietary coding models on cost and performance. The company said GLM-5.2 ranked just behind Anthropic’s Claude Opus 4.8 on FrontierSWE, a long-horizon coding benchmark, trailing it by 1%. Z.ai said the model also [&#8230;]</p>
<p>The post <a href="https://www.azalio.io/z-ai-pitches-glm-5-2-for-long-running-software-engineering-tasks/">Z.ai pitches GLM-5.2 for long-running software engineering tasks</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></description>
										<content:encoded><![CDATA[<div>
<div id="remove_no_follow">
<div class="grid grid--cols-10@md grid--cols-8@lg article-column">
<div class="col-12 col-10@md col-6@lg col-start-3@lg">
<div class="article-column__content">
<section class="wp-block-bigbite-multi-title">
<div class="container"></div>
</section>
<p>Z.ai has released GLM-5.2, an MIT-licensed open-source AI model designed for long-running software engineering tasks, as the Chinese company seeks to challenge proprietary coding models on cost and performance.</p>
<p>The company said GLM-5.2 ranked just behind Anthropic’s Claude Opus 4.8 on FrontierSWE, a long-horizon coding benchmark, trailing it by 1%. Z.ai said the model also edged out OpenAI’s GPT-5.5 by 1%.</p>
<p>Z.ai said GLM-5.2 supports a one million-token context window with up to 131,072 output tokens, positioning it for agentic coding workflows that require reasoning across large codebases.</p>
<p>The company is also making an efficiency argument. It said GLM-5.2 uses a technique called IndexShare, which reduces per-token compute by 2.9 times at a one million-token context length. It also said changes to the model’s multi-token prediction layer increased the acceptance length for speculative decoding by up to 20%.</p>
<p>The changes are aimed at a practical problem for developers: long-context coding agents can be expensive to run when they are asked to work across large repositories.</p>
<h2 class="wp-block-heading" id="enterprise-appeal">Enterprise appeal</h2>
<p>GLM-5.2’s clearest appeal is that it pairs stronger coding capabilities with the cost advantages of an open-source model. But capability alone will not be enough to make it a credible alternative.</p>
<p>“Western enterprises will want independent benchmark validation, successful deployments at global enterprises, strong security and governance controls, and long-term support commitments,” said <a href="https://pareekh.com/" target="_blank" rel="noreferrer noopener">Pareekh Jain</a>, CEO of Pareekh Consulting.</p>
<p>Jain said the fastest route to enterprise credibility would be hosting by a major cloud provider like AWS. That would allow customers to use the model under standard enterprise terms, with service-level commitments and compliance certifications.</p>
<p><a href="https://www.linkedin.com/in/tulikasheel/" target="_blank" rel="noreferrer noopener">Tulika Sheel</a>, senior VP at Kadence International, said GLM-5.2 would also need to prove it can operate as a stable enterprise product.</p>
<p>“Demonstrated success in real-world deployments and transparent governance will be just as important as benchmark scores,” Sheel said.</p>
<p>The performance and cost claims will also need to hold up against established models.</p>
<p>“Enterprise leaders generally consider two major factors when evaluating new models,” said <a href="https://omdia.tech.informa.com/authors/lian-jye-su" target="_blank" rel="noreferrer noopener">Lian Jye Su</a>, chief analyst at Omdia. “First, they look at overall performance against competitors, where GLM-5.2 performs well in long-horizon agentic coding and software engineering. Second, they look at the cost of adoption. As an open-source model, GLM-5.2 has clear cost advantages.”</p>
<p>Su said the model could appeal to engineering teams under pressure to control AI costs. It may also attract open-source advocates and companies with significant operations in Asia-Pacific.</p>
<p>But the claims still need wider validation, particularly around hallucination control and coherence during extended tasks. These are critical issues for enterprises considering AI coding agents, which may need to work across large codebases and multi-step software engineering workflows.</p>
<p>Jain said the one million-token context window could be useful for large codebase analysis. It could also help with legacy modernization projects and complex engineering documentation.</p>
<p>He said long-context capability may also help with audit logs or legal contracts, where splitting material into smaller chunks can create errors across document boundaries. But for everyday coding tasks, effective retrieval systems may matter more than very large context windows, making some of the benefits more limited in practice.</p>
<h2 class="wp-block-heading" id="governance-risks">Governance risks</h2>
<p>The governance question depends largely on where the model runs.</p>
<p>Sheel said enterprises should evaluate GLM-5.2 as they would any strategic technology partner, rather than as a standalone model. That means looking at where data is stored and whether the model can be used in environments customers control.</p>
<p>That deployment choice is central to the risk calculation, according to Jain. Because GLM-5.2 is available under an MIT license, companies can download the weights and run them on their own infrastructure, reducing the need to send sensitive data to Z.ai.</p>
<p>“The risk flips completely if you use Z.ai’s hosted API instead,” Jain said.</p>
<p>He said Chinese national security rules could require domestic companies to cooperate with government requests, making hosted use difficult for regulated industries or workloads involving sensitive data.</p>
<p>Su said the issue is not limited to Chinese vendors. Recent restrictions affecting access to some <a href="https://www.computerworld.com/article/4185515/anthropics-new-privacy-policy-offers-us-consumers-a-way-around-fable-ban-2.html">Anthropic models</a> have also highlighted the risk that enterprises may have limited control over the availability of AI services from foreign providers.</p>
<p>“Selecting solutions from American and Chinese AI vendors does expose non-US Western enterprises to additional risk of having zero control over the availability and uptime of these models,” Su said.</p>
</div>
</div>
</div>
</div>
</div><p>The post <a href="https://www.azalio.io/z-ai-pitches-glm-5-2-for-long-running-software-engineering-tasks/">Z.ai pitches GLM-5.2 for long-running software engineering tasks</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>From RAG to ontology: Databricks bets on context as the key to trusted AI agents</title>
		<link>https://www.azalio.io/from-rag-to-ontology-databricks-bets-on-context-as-the-key-to-trusted-ai-agents/</link>
		
		<dc:creator><![CDATA[Azalio tdshpsk]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 10:59:23 +0000</pubDate>
				<category><![CDATA[Cloud]]></category>
		<guid isPermaLink="false">https://www.azalio.io/from-rag-to-ontology-databricks-bets-on-context-as-the-key-to-trusted-ai-agents/</guid>

					<description><![CDATA[<p>First came vector databases, then RAG. Now, the next frontier in enterprise AI is taking shape: context layers that give autonomous agents a shared understanding of the business, a vision Databricks is advancing with Genie Ontology. Currently in preview, Genie Ontology automatically extracts business context from enterprise data, dashboards, queries, pipelines, documents, and applications and [&#8230;]</p>
<p>The post <a href="https://www.azalio.io/from-rag-to-ontology-databricks-bets-on-context-as-the-key-to-trusted-ai-agents/">From RAG to ontology: Databricks bets on context as the key to trusted AI agents</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></description>
										<content:encoded><![CDATA[<div>
<div id="remove_no_follow">
<div class="grid grid--cols-10@md grid--cols-8@lg article-column">
<div class="col-12 col-10@md col-6@lg col-start-3@lg">
<div class="article-column__content">
<section class="wp-block-bigbite-multi-title">
<div class="container"></div>
</section>
<p>First came vector databases, then RAG. Now, the next frontier in enterprise AI is taking shape: context layers that give autonomous agents a shared understanding of the business, a vision Databricks is advancing with Genie Ontology.</p>
<p>Currently in preview, Genie Ontology automatically extracts business context from enterprise data, dashboards, queries, pipelines, documents, and applications and organizes it into a living graph that AI agents can use to understand how an organization operates.</p>
<p>Showcased at the company’s Data + AI Summit, Genie Ontology uses a ranking system inspired by <a href="http://infolab.stanford.edu/~backrub/google.html" target="_blank" rel="noreferrer noopener">Google’s PageRank</a> to identify the most authoritative business definitions within an organization.</p>
<p>Rather than treating all sources equally, it weighs factors including who created the information, how widely it is used, its links to certified datasets and assets, and how recently it was updated before determining which answer an AI agent should rely on, Databricks CEO <a href="https://www.linkedin.com/in/alighodsi/" target="_blank" rel="noreferrer noopener">Ali Ghodsi</a> said during his keynote late on Tuesday while explaining the new offering.</p>
<p>Organizations can also upload their own business definitions or ontologies to Genie Ontology via Databricks’ existing Unity Catalog Semantics platform, Ghodsi added.</p>
<h2 class="wp-block-heading" id="ontology-promises-consistency-but-readiness-remains-a-hurdle">Ontology promises consistency, but readiness remains a hurdle</h2>
<p>For CIOs, a unified context layer, such as Genie Ontology, will materially improve consistency, trust, and governance for enterprise AI deployments, according to analysts.</p>
<p>“One definition feeding every agent means you stop getting three different answers to the same question,” said <a href="https://moorinsightsstrategy.com/team/mike-leone/" target="_blank" rel="noreferrer noopener">Michael Leone</a>, principal analyst at Moor Insights and Strategy.</p>
<p>“Older approaches, such as RAG and vector search, just pull back whatever looks similar to your question, and they don’t actually understand your business. An ontology gives the agent the meaning a catalog can’t, what your terms mean, and which source to trust,” Leone added.</p>
<p>That improvement in consistency, according to <a href="https://www.hfsresearch.com/team/ashish-chaturvedi/">Ashish Chaturvedi</a>, leader of executive research at HFS Research, could also improve trust, which remains one of the most critical barriers to AI adoption.</p>
<p>“The single biggest barrier to enterprise AI adoption is that decision-makers don’t trust AI outputs enough to act on them without checking. An ontology that grounds answers in governed business definitions, with lineage back to source, directly attacks that trust deficit,” Chaturvedi said.</p>
<p>Alternatively, Leone was more cautious about the trust argument: “It’s a promising idea, but it still has to prove itself before I’d lean on it for anything that matters.”</p>
<p>Echoing Leone, HyperFRAME Research’s practice leader of AI stack <a href="https://www.linkedin.com/m/in/slwalter">Stephanie Walter</a> pointed out that ontologies have a missing link, and that is verification: “Ontologies can improve context, but they do not guarantee the answer is correct. An agent can still pull incomplete data, apply the wrong logic, skip rows, misunderstand a workflow, or take the wrong action.”</p>
<p>That verification gap becomes even more critical, according to Leone, because most enterprises don’t have the data and governance readiness required to implement an ontology layer for AI deployments: “If your data and governance aren’t already in order, this just speeds up your existing mess.”</p>
<p>Seconding Leone, Walter pointed out that an ontology cannot fix messy definitions, poor lineage, weak ownership, or fragmented permissions on its own.</p>
<p>Additionally, the analyst pointed out that the hard part for CIOs is not creating an ontology once but keeping it accurate as the business changes: “Enterprises will need clear data ownership, metric ownership, domain expertise, governance processes, and a way to resolve conflicting definitions.”</p>
<p>“Otherwise, the ontology becomes another stale metadata project with a more sophisticated name,” Walter added.</p>
<h2 class="wp-block-heading" id="a-growing-risk-of-cio-confusion">A growing risk of CIO confusion</h2>
<p>Beyond data and governance readiness, CIOs also face a growing risk of confusion in the wake of several technology vendors pursuing approaches, similar to Genie Ontology, to ground enterprise AI in a business context, according to analysts.</p>
<p>Over the past year, Snowflake, Microsoft, and others have introduced some form of ontology, semantic, and context-layer offerings, but the problem is in how these offerings are named, Leone said.</p>
<p>“Everyone slapped a different name on basically the same idea. It slows people down as it creates confusion,” Leone noted.</p>
<p>That confusion could also backfire on Databricks and other vendors, according to <a href="https://www.linkedin.com/in/bhupendrachopra/" target="_blank" rel="noreferrer noopener">Bhupendra Chopra</a>, cofounder and CRO of IT consulting firm Kanerika: “While the marketing has converged around context-building offerings, most enterprises will choose the platform where their data already resides.”</p>
<p>HFS Research’s Chaturvedi doubled down on that view, saying CIOs should resist evaluating ontology offerings in isolation and asked them to stick to the mantra of context layer follows data gravity: “If your data lives in Databricks, Genie Ontology is your path. If it’s in Snowflake, <a href="https://www.cio.com/article/4180170/snowflakes-horizon-context-aims-to-give-ai-agents-a-common-understanding-of-the-business.html">Horizon Context</a> is. If you’re a Microsoft shop, the <a href="https://www.infoworld.com/article/4093181/microsoft-fabric-iq-adds-semantic-intelligence-layer-to-fabric.html">IQ</a> family is.”</p>
<p>Additionally, Chaturvedi urged CIOs to look beyond functionality and assess how open and portable these offerings are, particularly in multi-platform environments where business definitions may need to move across data <a href="https://www.infoworld.com/article/2334907/review-databricks-lakehouse-platform.html">lakehouses</a>, analytics tools, and AI platforms.</p>
<p>This is where Chaturvedi sees Snowflake differentiating itself from rivals, with its focus on open semantic interoperability aimed at reducing the risk of semantic lock-in as enterprises evolve their data and analytics stacks.</p>
<h2 class="wp-block-heading" id="the-battle-for-the-ai-control-plane">The battle for the AI control plane</h2>
<p>Snowflake’s efforts to differentiate itself, though, analysts pointed out, at least for CIOs, draw attention to a larger race among vendors, including Databricks, to become the control plane for enterprise AI.</p>
<p>While Snowflake is attempting to position itself as an AI control layer through a combination of <a href="https://www.infoworld.com/article/3603375/snowflake-bares-its-agentic-ai-plans-by-showcasing-its-intelligence-platform.html">Snowflake Intelligence</a>, Horizon Catalog, and its push for open semantic interoperability, Microsoft is embedding business context and governance across its Copilot, Fabric, and broader AI stack through offerings such as Work IQ, Fabric IQ, and Foundry IQ, Chaturvedi said.</p>
<p>Databricks’ Genie Ontology, too, is part of a similar strategy, Chaturvedi pointed out, urging CIOs to view the offering in the context of the company’s wider effort to position its lakehouse platform as the foundation on which enterprise AI agents are built, governed, and eventually deployed.</p>
<p>“It’s absolutely a control-plane play. When you connect the dots across everything Databricks has announced at this summit, including <a href="https://www.infoworld.com/article/4185622/databricks-pitches-ltap-as-a-new-foundation-for-agentic-applications.html">LTAP</a>, <a href="https://www.infoworld.com/article/4184076/databricks-opensharing-targets-the-integration-tax-of-enterprise-ai.html">OpenSharing</a>, and Genie Ontology, you see a single place where enterprise data, governance, business semantics, and agent execution all converge,” Chaturvedi added.</p>
<p>Further, the analyst noted that the control-plane strategy reflects Ghodsi’s broader vision that data platforms could evolve into what the CEO describes as an “agentic system of record” — an authoritative source that AI agents read from, reason over, and act through.</p>
<p>The concept mirrors earlier platform shifts, Chaturvedi said, where ERP systems became the system of record for business transactions and data warehouses became the system of record for analytics.</p>
<p>The next battle, the analyst said, is over which platform becomes the system of record for enterprise AI agents.</p>
<p>Moor Insights and Strategy’s Leone agreed that data platforms are well-positioned to compete for that role because they already own the data, governance controls, lineage, and permissions that agents require to operate safely at scale.</p>
<p>Still, analysts cautioned that context alone will not determine which vendor comes out on top.</p>
<p>“The next enterprise AI battleground is not just context. It is verifiable execution,” Walter said.</p>
</div>
</div>
</div>
</div>
</div><p>The post <a href="https://www.azalio.io/from-rag-to-ontology-databricks-bets-on-context-as-the-key-to-trusted-ai-agents/">From RAG to ontology: Databricks bets on context as the key to trusted AI agents</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Designing frontend systems for cloud latency, not just cloud failure</title>
		<link>https://www.azalio.io/designing-frontend-systems-for-cloud-latency-not-just-cloud-failure/</link>
		
		<dc:creator><![CDATA[Azalio tdshpsk]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 09:59:35 +0000</pubDate>
				<category><![CDATA[Cloud]]></category>
		<guid isPermaLink="false">https://www.azalio.io/designing-frontend-systems-for-cloud-latency-not-just-cloud-failure/</guid>

					<description><![CDATA[<p>Frontend reliability is often discussed in terms of outages. Teams prepare for failed API calls, downtime and visible crashes because those failures are easy to recognize and measure. However, in many modern applications, the bigger challenge is not complete failure but latency. Systems rarely go fully offline. Instead, they become slow enough that users lose [&#8230;]</p>
<p>The post <a href="https://www.azalio.io/designing-frontend-systems-for-cloud-latency-not-just-cloud-failure/">Designing frontend systems for cloud latency, not just cloud failure</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></description>
										<content:encoded><![CDATA[<div>
<div id="remove_no_follow">
<div class="grid grid--cols-10@md grid--cols-8@lg article-column">
<div class="col-12 col-10@md col-6@lg col-start-3@lg">
<div class="article-column__content">
<section class="wp-block-bigbite-multi-title">
<div class="container"></div>
</section>
<p>Frontend reliability is often discussed in terms of outages. Teams prepare for failed API calls, downtime and visible crashes because those failures are easy to recognize and measure. However, in many modern applications, the bigger challenge is not complete failure but latency. Systems rarely go fully offline. Instead, they become slow enough that users lose confidence in the interface long before anything technically breaks.</p>
<p>Most frontend engineers have experienced this in production. A page eventually loads, but only after several seconds of waiting. A save action succeeds in the backend, yet the interface remains unchanged long enough that the user clicks the button again. A dashboard renders immediately, but the critical data appears so late that the application feels unstable. In practice, users rarely distinguish between “slow” and “broken.” If an interaction feels uncertain or delayed, trust drops quickly.</p>
<p>As frontend systems become increasingly dependent on distributed cloud infrastructure, latency becomes a normal operating condition rather than an occasional exception. APIs may depend on multiple downstream services, serverless systems may introduce startup delays and state updates may propagate asynchronously across regions or caches. Frontend reliability therefore can no longer be defined only by uptime. It also depends on how clearly the interface behaves while waiting on slow cloud dependencies.</p>
<h2 class="wp-block-heading" id="reliability-is-more-than-preventing-outages">Reliability is more than preventing outages</h2>
<p><a href="https://www.infoworld.com/article/4167398/designing-front-end-systems-for-cloud-failure.html">Frontend teams traditionally define reliability around visible failure</a>. A stable application is one that avoids blank screens, uncaught exceptions and broken API requests. While those issues still matter, they represent only one category of reliability problems.</p>
<p>In many production systems, the application remains technically available while still feeling unreliable to users. An API request that takes ten seconds may still return successfully. A delayed state update may eventually synchronize correctly. A dashboard may fully render after several refresh cycles. From a monitoring perspective, these systems appear healthy. From a user perspective, however, the experience feels inconsistent and unpredictable.</p>
<p>What makes this difficult is that many frontend applications are still designed around ideal conditions. Interfaces often assume fast responses, immediate consistency and stable network behavior. Cloud environments rarely behave with that level of predictability. Distributed systems naturally introduce latency, asynchronous updates and intermittent delays, even when everything is technically functioning correctly.</p>
<p>This changes how frontend engineers need to think about resilience. Modern interfaces are no longer designed only for success and failure. They also need to handle slow success, delayed feedback and partial availability in ways that remain understandable to users.</p>
<h2 class="wp-block-heading" id="slow-apis-and-the-user-experience-problem">Slow APIs and the user experience problem</h2>
<p>One of the most common latency issues frontend teams encounter is the slow API response. Technically, the request succeeds, but the delay changes how users interpret the application. A user waiting several seconds for a response rarely thinks about infrastructure behavior or distributed systems. They assume the interface is frozen, unresponsive or broken.</p>
<p>This becomes especially noticeable in cloud-connected systems where response times vary depending on caching layers, regional routing, serverless initialization or downstream dependencies. Frontend applications designed around consistently fast responses often struggle in these conditions because they treat latency as exceptional instead of expected.</p>
<p>Many applications still rely on generic loading spinners that provide little context beyond “something is happening.” While spinners technically indicate activity, they also communicate uncertainty. Users do not know whether the system is progressing, stalled or close to failure. As waiting time increases, frustration grows quickly.</p>
<p>More effective interfaces treat loading as part of the user experience rather than a temporary placeholder. Skeleton screens, layout-preserving placeholders and contextual loading feedback provide users with a clearer understanding of what is happening. Even relatively small decisions can make a noticeable difference. For example, users generally tolerate a dashboard chart loading later far more than they tolerate waiting several seconds before the navigation menu appears. Similarly, an e-commerce page that progressively loads recommendations often feels faster than one that blocks the entire interface until every service responds.</p>
<p>These patterns do not reduce actual latency, but they significantly improve perceived responsiveness. When users understand that the system is actively processing their request, they are far more tolerant of delays.</p>
<h2 class="wp-block-heading" id="delayed-state-updates-and-perceived-instability">Delayed state updates and perceived instability</h2>
<p>Another common issue in cloud-based systems is delayed state synchronization. A user submits an action successfully, but the interface does not immediately reflect the result. Even if the backend processes the request correctly, the delay creates uncertainty around whether the action worked at all.</p>
<p>This behavior is increasingly common in distributed architectures where writes may propagate asynchronously across services, regions or caches. Frontend applications that assume immediate consistency often expose these delays directly to users, leading to repeated clicks, duplicate submissions and confusion around application state.</p>
<p>In many cases, the real problem is not the delay itself but the absence of clear feedback. Users are generally willing to wait if the interface acknowledges that progress is occurring. Without feedback, even short delays begin to feel unreliable.</p>
<p>Frontend systems can reduce this uncertainty by designing around transitional states more intentionally. Pending confirmations, temporary UI updates and clearer progress indicators help maintain confidence while backend systems synchronize. Users care less about exact infrastructure timing than about whether the interface feels responsive and understandable throughout the interaction.</p>
<h2 class="wp-block-heading" id="progressive-rendering-and-partial-availability">Progressive rendering and partial availability</h2>
<p>Traditional frontend rendering strategies often wait for all required data before displaying meaningful content. While this simplifies state management, it also magnifies perceived latency by forcing users to stare at incomplete or empty screens.</p>
<p>Modern cloud systems rarely deliver all information simultaneously. Some requests complete quickly while others depend on slower downstream services. Treating the entire interface as blocked until every dependency responds unnecessarily increases the feeling of slowness.</p>
<p>Progressive rendering offers a more resilient alternative. Instead of waiting for the entire page, the application renders available content immediately while secondary information loads incrementally. A profile page may display account details before loading recommendations. A dashboard may render navigation and layout structure first while analytics populate later. This creates a sense of continuous progress rather than complete inactivity.</p>
<p>What makes this approach effective is that users tolerate delayed secondary content far better than delayed primary interaction. Even when total loading time remains unchanged, the interface feels significantly more stable because progress is visible.</p>
<p>Progressive rendering also changes how frontend teams think about dependency importance. Not every cloud service needs to block the entire user experience. Separating critical interaction paths from secondary content creates systems that remain usable even under imperfect network conditions.</p>
<h2 class="wp-block-heading" id="designing-recovery-not-just-error-messages">Designing recovery, not just error messages</h2>
<p>Latency-related problems are often made worse by weak recovery patterns. Many applications either wait indefinitely or fail abruptly with vague messaging such as “Something went wrong.” Neither approach creates confidence for users who are already uncertain about system behavior.</p>
<p>Frontend resilience improves significantly when interfaces guide users through delays and recovery intentionally. Instead of displaying generic failures immediately, applications can acknowledge that an operation is taking longer than expected while still preserving user progress. Retry actions, recoverable workflows and state preservation help users continue without restarting tasks entirely.</p>
<p>This becomes especially important in forms and transactional flows. Losing user input because of a temporary timeout often damages trust more than the delay itself. In practice, people are usually willing to retry an action. They are far less willing to repeat work they already completed once.</p>
<p>These recovery patterns are not simply UX enhancements. They are architectural decisions that recognize cloud latency as a normal operating condition rather than an edge case.</p>
<h2 class="wp-block-heading" id="designing-for-latency-is-designing-for-reality">Designing for latency is designing for reality</h2>
<p>As frontend systems become more dependent on distributed cloud infrastructure, latency becomes unavoidable. The most reliable interfaces are not necessarily the ones with the fastest response times. They are the ones that communicate clearly, preserve user confidence and remain usable during delays.</p>
<p>Frontend teams spend enormous effort optimizing rendering performance while often underestimating the impact of latency behavior on user trust. In many applications, the difference between a reliable experience and a frustrating one is not milliseconds of rendering speed but how clearly the interface handles waiting.</p>
<p>Frontend engineers do not need to become cloud infrastructure specialists to design for these realities. However, understanding that cloud latency is a normal part of modern application behavior fundamentally changes how interfaces should be built. Designing for latency is ultimately not about preparing for rare outages. It is about designing for the way modern systems behave every day.</p>
<p><strong>This article is published as part of the Foundry Expert Contributor Network.</strong><br /><strong><a href="https://www.infoworld.com/expert-contributor-network/">Want to join?</a></strong></p>
</div>
</div>
</div>
</div>
</div><p>The post <a href="https://www.azalio.io/designing-frontend-systems-for-cloud-latency-not-just-cloud-failure/">Designing frontend systems for cloud latency, not just cloud failure</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>10 tips for getting better R code from your AI coding agent</title>
		<link>https://www.azalio.io/10-tips-for-getting-better-r-code-from-your-ai-coding-agent/</link>
		
		<dc:creator><![CDATA[Azalio tdshpsk]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 09:59:35 +0000</pubDate>
				<category><![CDATA[Cloud]]></category>
		<guid isPermaLink="false">https://www.azalio.io/10-tips-for-getting-better-r-code-from-your-ai-coding-agent/</guid>

					<description><![CDATA[<p>Most generative AI tools know less about R than languages like JavaScript and Python, thanks to how much training data is available for each. However, with a little extra setup, you can give a large language model (LLM) the knowledge it needs to improve its R skills. Here are 10 ways to help generative AI [&#8230;]</p>
<p>The post <a href="https://www.azalio.io/10-tips-for-getting-better-r-code-from-your-ai-coding-agent/">10 tips for getting better R code from your AI coding agent</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></description>
										<content:encoded><![CDATA[<div>
<div id="remove_no_follow">
<div class="grid grid--cols-10@md grid--cols-8@lg article-column">
<div class="col-12 col-10@md col-6@lg col-start-3@lg">
<div class="article-column__content">
<section class="wp-block-bigbite-multi-title">
<div class="container"></div>
</section>
<p>Most <a href="https://www.infoworld.com/article/2338115/what-is-generative-ai-artificial-intelligence-that-creates.html" data-type="link" data-id="https://www.infoworld.com/article/2338115/what-is-generative-ai-artificial-intelligence-that-creates.html">generative AI</a> tools know less about R than languages like <a href="https://www.infoworld.com/article/2263137/what-is-javascript-the-full-stack-programming-language.html" data-type="link" data-id="https://www.infoworld.com/article/2263137/what-is-javascript-the-full-stack-programming-language.html">JavaScript</a> and <a href="https://www.infoworld.com/article/2253770/what-is-python-powerful-intuitive-programming.html" data-type="link" data-id="https://www.infoworld.com/article/2253770/what-is-python-powerful-intuitive-programming.html">Python</a>, thanks to how much training data is available for each. However, with a little extra setup, you can give a <a href="https://www.infoworld.com/article/2335213/large-language-models-the-foundations-of-generative-ai.html" data-type="link" data-id="https://www.infoworld.com/article/2335213/large-language-models-the-foundations-of-generative-ai.html">large language model</a> (LLM) the knowledge it needs to improve its R skills.</p>
<p>Here are 10 ways to help generative AI write R code like a pro.</p>
<h2 class="wp-block-heading" id="use-a-coding-agent">Use a coding agent </h2>
<p>AI coding agents have more power, flexibility, and coding-focused tools than general-purpose chatbots.</p>
<p>Anthropic’s Claude Code and OpenAI’s Codex agents have versions that run in a terminal, IDE extensions, desktop and mobile apps, and other integrations.</p>
<p>R users may also be interested in Posit’s <a href="https://posit-dev.github.io/assistant/" data-type="link" data-id="https://posit-dev.github.io/assistant/">Posit Assistant</a>, which is designed for data analysis in both R and Python. It needs less setup for R than general-purpose coding agents, and it has more built-in knowledge about data science, R package development, and Shiny apps. Plus, it can read objects in your R and Python environments by default, which can be useful in some situations (although perhaps not if you’re working with sensitive data). There are versions of Posit Assistant for RStudio, the Positron IDE, and the terminal. Setup info is <a href="https://posit-dev.github.io/assistant/docs/downloads/positron/" data-type="link" data-id="https://posit-dev.github.io/assistant/docs/downloads/positron/">here</a>.</p>
<p>Note: Posit Assistant is <em>not</em> the same as the older Posit<em>ron</em> Assistant that was demo’d at last year’s posit::conf(). The older assistant will eventually be superceded by Posit Assistant. Posit Assistant comes pre-installed with RStudio but not pre-activated, so you don’t need to use it unless you want to.</p>
<p>Google’s original entry in this space, Gemini CLI, is being retired in favor of a new “Antigravity CLI” <a href="https://developers.googleblog.com/an-important-update-transitioning-gemini-cli-to-antigravity-cli/" data-type="link" data-id="https://developers.googleblog.com/an-important-update-transitioning-gemini-cli-to-antigravity-cli/">starting June 18</a>. </p>
<p>There are other IDE platforms with built-in agents, including Cursor and Windsurf, and terminal + subscription coding agents such as Warp, which was <a href="https://www.warp.dev/blog/warp-is-now-open-source" data-type="link" data-id="https://www.warp.dev/blog/warp-is-now-open-source">open-sourced in April</a>. However, I won’t be covering those here.</p>
<p>You may be limited at work by what your employer allows you to use, but you can still experiment on your own with public data — and then lobby for a new agent if a different one better meets your needs. Whatever you choose, though, moving to one of these from a web-based chatbot should produce better code. </p>
<p>“The shift from chatbot to agent is the most important change in how people use AI since ChatGPT launched,” <a href="https://www.oneusefulthing.org/p/a-guide-to-which-ai-to-use-in-the">says Ethan Mollick</a>, co-director of the Wharton School’s Generative AI Labs at the University of Pennsylvania. </p>
<h2 class="wp-block-heading" id="set-up-claude-md-agents-md-or-gemini-md-knowledge-files">Set up CLAUDE.md, AGENTS.md, or GEMINI.md knowledge files</h2>
<p>All the major coding agents look for knowledge files to load each time you start a session, and you can easily edit those files. The files can include things like your proficiency in various languages and how you like to document projects. For example, I tell my LLMs that I’d like a README.md in every project with info on how to use the project and technical info on how the project was coded.</p>
<p>Most coding tools let you have both an overarching main file that’s loaded into every session, and an additional file per project.</p>
<p>If you’re adding Claude or Codex to an existing code base, the <code>/init</code> slash command will create one of those .md project files after scanning and analyzing the code. You can also ask an LLM to interview you to create or add to one of those files.</p>
<p>Claude Code looks for a user-level main CLAUDE.md file at ~/.claude/CLAUDE.md and then a project-level file at ./CLAUDE.md or ./.claude/CLAUDE.md. The <code>/memory</code> slash command shows where those files are located. Codex uses <a href="https://developers.openai.com/codex/guides/agents-md" data-type="link" data-id="https://developers.openai.com/codex/guides/agents-md">AGENTS.md files</a>. Antigravity docs say it looks for a main GEMINI.md file plus GEMINI.md and/or AGENTS.md files in your working directory.</p>
<p>Positron Assistant is set up to read AGENT.md, AGENTS.md, POSITRON.md, CLAUDE.md, GEMINI.md, and LLMS.txt in a workspace root directory, according to the docs. See the <a href="https://positron.posit.co/assistant-chat-instructions.html" data-type="link" data-id="https://positron.posit.co/assistant-chat-instructions.html">Positron docs</a> for more on custom instructions.</p>
<h2 class="wp-block-heading" id="use-agent-skills">Use agent skills</h2>
<p>Agent skills can turn an LLM with questionable R knowledge into an expert that understand how you want it to write code. You can develop your own skills, work with your LLM to create skills, or download skills written by others. Claude even has a skill to help you build skills. This is one of the best ways to stop having to repeat the same instructions over and over. Instead, the LLM will “remember” what’s in your skills files by loading them when they’re needed.</p>
<p>What’s the difference between the knowledge .md files mentioned above and a skill? I like to think of CLAUDE.md or AGENTS.md files as <em>important background info that’s always available</em>, while skills are <em>more specific instructions that are useful at certain times.</em> Or, as Claude Opus 4.7 put it: “Skills are for triggered workflows (‘when user asks X, do Y’). A standing rule that applies to every project isn’t trigger-based; it’s a directive I should always follow.” Standing rules should go in CLAUDE.md.</p>
<p>“Create a skill when you keep pasting the same instructions, checklist, or multi-step procedure into chat, or when a section of CLAUDE.md has grown into a procedure rather than a fact,” according to Anthropic’s <a href="https://code.claude.com/docs/en/skills" data-type="link" data-id="https://code.claude.com/docs/en/skills">Claude Code docs</a>. “Unlike CLAUDE.md content, a skill’s body loads only when it’s used, so long reference material costs almost nothing until you need it.” Only skill <em>descriptions</em> are loaded at the start of each session, so the LLM knows what skills are available.</p>
<p>Anthropic originally created skills for Claude <a href="https://www.anthropic.com/engineering/equipping-agents-for-the-real-world-with-agent-skills" data-type="link" data-id="https://www.anthropic.com/engineering/equipping-agents-for-the-real-world-with-agent-skills">in October 2025</a> and then published the concept as <a href="https://agentskills.io/home" data-type="link" data-id="https://agentskills.io/home">an open standard</a> a couple of months later. Most coding agents now use them.</p>
<p>Anthropic has an easy-to-folllow guide to creating custom skills <a href="https://support.claude.com/en/articles/12512198-how-to-create-custom-skills" data-type="link" data-id="https://support.claude.com/en/articles/12512198-how-to-create-custom-skills">here</a> if you want to try your hand at building your own. </p>
<p>Posit Assistant comes with several R-related skills built in and allows users to add more. You’ll find them <a href="https://posit-dev.github.io/assistant/docs/features/skills/" data-type="link" data-id="https://posit-dev.github.io/assistant/docs/features/skills/">here</a>.</p>
<p>If you’re not using Posit Assistant, you can add Posit-developed skills to another coding agent. Posit published a GitHub repository with skills for creating R packages, Shiny apps, and Quarto documents; for creating and resolving GitHub pull requests; for open-source R and Python package releases; for code reviews (R, Python, JavaScript/TypeScript, SQL); and for creating architectural documentation for a codebase. Browse them all <a href="https://github.com/posit-dev/skills" data-type="link" data-id="https://github.com/posit-dev/skills">here</a>. </p>
<h2 class="wp-block-heading" id="read-and-edit-the-skills-you-download">Read and edit the skills you download</h2>
<p>Skills are just a folder with one or more markdown text files that have a structured YAML header. That folder can include optional resources and scripts, too. Skill instruction files are easy to read and edit, so you can make sure they do what you want. Tweak them as desired.</p>
<p>For example, Claude helped me write my own customized R skill. Claude leans toward tidyverse packages, but there are other packages I like a lot, too, such data.table. I made sure the R skill knows my package preferences, and under what circumstances I prefer each.</p>
<p>If one of your preferred procedures changes, remember to update the associated skills. Or, ask an LLM to update it for you. Keeping skills up to date will make your coding agent more useful.</p>
<p>Keep your CLAUDE.md or AGENTS.md files updated, too. Maybe you started off with a CLAUDE.md file that said you’re an experienced Python programmer who is just learning R. Three months from now, you might want to update your R skill level. Or, perhaps you told an AGENTS.md file that you like thoroughly documented code, but now you’d prefer a lighter touch. You may want to have your coding agent go over those files with you from time to time and ask you about the main points and whether they need updating.</p>
<h2 class="wp-block-heading" id="use-the-btw-r-package-and-its-mcp-server">Use the btw R package and its MCP server </h2>
<p>It’s frustrating when an LLM writes code based on an ancient R package version in its training data, or when it doesn’t know about a relatively recent package at all. <a href="https://posit-dev.github.io/btw/" data-type="link" data-id="https://posit-dev.github.io/btw/">btw</a> solves this problem by letting your coding agent access info about all the R packages <em>installed on your system.</em> That means it can write code based on your specific R environment. Plus, btw lets an LLM access variables in your current R environment via a <a href="https://www.infoworld.com/article/4029634/what-is-model-context-protocol-how-mcp-bridges-ai-and-external-services.html" data-type="link" data-id="https://www.infoworld.com/article/4029634/what-is-model-context-protocol-how-mcp-bridges-ai-and-external-services.html">Model Context Protocol</a> (MCP) server.</p>
<p>MCP servers are a standardized way for LLMs to access external data — in this case, your running R session (external doesn’t have to mean cloud). Like skills, MCP was created by Anthropic as an open standard and has since been adopted by most major AI platforms. The <a href="https://posit-dev.github.io/btw/" data-type="link" data-id="https://posit-dev.github.io/btw/">btw R package</a> includes an MCP server, which you can install for Claude Code by running the following code in a terminal window (not R console).</p>
<pre class="wp-block-code"><code>claude mcp add -s "user" r-btw -- Rscript -e "btw::btw_mcp_server()"
</code></pre>
<p>btw comes with a <em>lot</em> of tools. It’s not a great idea to overload your LLM with tools it may never use. If you just want to register btw tools for accessing your R session and looking up package documentation, you can run a command like this instead:</p>
<pre class="wp-block-code"><code>claude mcp add -s "user" r-btw -- Rscript -e "btw::btw_mcp_server(tools = btw::btw_tools('btw_tool_run_r', 'docs', 'env'))"
</code></pre>
<p>You only need to do this once.</p>
<p>To connect Claude Code to your open R session, you also need to run <code>btw::btw_mcp_session()</code> <em>in every new R session.</em> (I sometimes forget this part until I’m puzzled why Claude can’t read my R variables or even see what packages I have installed.)</p>
<p>You can see a demo of btw in <a href="https://youtu.be/7GI6-4J0AXA?si=gUeERaXsnh-IZsYC&amp;t=176" data-type="link" data-id="https://youtu.be/7GI6-4J0AXA?si=gUeERaXsnh-IZsYC&amp;t=176">this Posit video</a> that compares Claude Code and Posit Assistant. </p>
<div class="extendedBlock-wrapper block-coreImage undefined">
<figure class="wp-block-image size-full"><img loading="lazy" decoding="async" src="https://b2b-contenthub.com/wp-content/uploads/2026/06/test_claude_code_btw.jpg?quality=50&amp;strip=all" alt="test_claude_code_btw" class="wp-image-4184658" width="942" height="511" sizes="auto, (max-width: 942px) 100vw, 942px"><figcaption class="wp-element-caption">
<p>The btw R package lets Claude Code and other agents read objects in your R session.</p>
</figcaption></figure>
<p class="imageCredit">Foundry</p>
</div>
<p>You don’t need to do this setup for Posit Assistant, since tools for accessing your R session and variables are built in.</p>
<h2 class="wp-block-heading" id="use-plan-mode-for-your-projects">Use plan mode for your projects</h2>
<p>I cannot stress this enough. Just as humans benefit from having a plan before starting to implement a project, LLMs benefit from working through a plan before beginning to generate code. Plus, it can be useful at times to “brainstorm” with an LLM about structures and features, ask it for alternatives, or even have another LLM review a plan before implementing it.</p>
<p>Anthropic advises starting with plan mode. “Letting Claude jump straight to coding can produce code that solves the wrong problem. Use plan mode to separate exploration from execution,” they wrote in a <a href="https://code.claude.com/docs/en/best-practices" data-type="link" data-id="https://code.claude.com/docs/en/best-practices">best practices doc</a>. </p>
<p>Claude, Codex, Gemini, and Posit Assistant all have a plan mode in their CLI versions that can be activated with the <code>/plan</code> slash command.</p>
<div class="extendedBlock-wrapper block-coreImage undefined">
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" src="https://b2b-contenthub.com/wp-content/uploads/2026/06/claude_plan1.jpg?quality=50&amp;strip=all&amp;w=1024" alt="claude_plan1" class="wp-image-4184722" width="1024" height="399" sizes="auto, (max-width: 1024px) 100vw, 1024px"><figcaption class="wp-element-caption">
<p></p>
</figcaption></figure>
<p class="imageCredit">Foundry</p>
</div>
<div class="extendedBlock-wrapper block-coreImage undefined">
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" src="https://b2b-contenthub.com/wp-content/uploads/2026/06/claude_plan2_8e2bf0.jpg?quality=50&amp;strip=all&amp;w=1024" alt="claude_plan2 rev" class="wp-image-4184724" width="1024" height="414" sizes="auto, (max-width: 1024px) 100vw, 1024px"><figcaption class="wp-element-caption">
<p>When in plan mode activated with /plan, Claude will often ask questions before coming up with its plan.</p>
</figcaption></figure>
<p class="imageCredit">Foundry</p>
</div>
<h2 class="wp-block-heading" id="make-sure-your-coding-agent-learns-from-its-mistakes">Make sure your coding agent learns from its mistakes </h2>
<p>This is one of the best tips I read first from Joe Amditis, associate director of operations at the Center for Cooperative Media at Montclair State University: If your coding agent makes a mistake and you work together to fix it, make sure to save that memory so it doesn’t make the same error again.</p>
<p>Your agent may have an auto memory system that’s supposed to do some of this (I’ve used Claude Code’s), but you can instruct any coding agent to keep a separate lessons learned file for each project, too, or ask it what’s the best way to ensure it doesn’t make the same error again. This isn’t a guarantee, any more than you can be assured an LLM will follow all the rules in a CLAUDE.md or AGENTS.md file or all the steps in a skill. But it has definitely cut down on my agents making the same errors over and over.</p>
<h2 class="wp-block-heading" id="have-your-agent-write-tests-and-do-code-reviews">Have your agent write tests and do code reviews </h2>
<p>Tests and code reviews done by a coding agent aren’t a substitute for a human checking that software works. Review an LLM’s results yourself any time a task is important! However, agent-generated tests are still a good way to head off some problems and cut down the time you spend as your LLM’s debugging partner. If you don’t know where to start to help your agent know how to test and review code, see Posit’s <a href="https://github.com/posit-dev/skills/tree/main/r-lib/testing-r-packages" data-type="link" data-id="https://github.com/posit-dev/skills/tree/main/r-lib/testing-r-packages">testing-r-packages</a> and <a href="https://github.com/posit-dev/skills/tree/main/posit-dev/critical-code-reviewer" data-type="link" data-id="https://github.com/posit-dev/skills/tree/main/posit-dev/critical-code-reviewer">critical-code-reviewer</a> skills. Your agent may also have built-in code review skills, such as Claude Code’s <code>/code-review</code> slash command.</p>
<p>Code review skills are available from many sources. For example, Sentry has a code review skill in its repository. Aimed specifically at its project’s needs, it can be a useful sample for similar skills. And <a href="https://github.com/obra/superpowers" data-type="link" data-id="https://github.com/obra/superpowers">superpowers</a> is a popular set of general-purpose programming skills that’s billed as “a complete software development methodology for your coding agents, built on top of a set of composable skills and some initial instructions that make sure your agent uses them.” I don’t use superpowers, but a lot of other people clearly do given its more than 227,000 GitHub stars and 20,000 forks.</p>
<p>Another tip: Several experts suggest using a different LLM — maybe even from a different provider — to conduct a code review, since each model has its own strengths and weaknesses.</p>
<h2 class="wp-block-heading" id="dont-forget-general-prompting-good-habits">Don’t forget general prompting good habits </h2>
<p>Whatever programming language you’re using, being clear will improve your outputs. Keep your requests targeted, and don’t expect the LLM to read your mind.</p>
<p>“Codex handles complex work better when you break it into smaller, focused steps,” OpenAI says in its <a href="https://developers.openai.com/codex/prompting" data-type="link" data-id="https://developers.openai.com/codex/prompting">developer docs</a>. “Smaller tasks are easier for Codex to test and for you to review. If you’re not sure how to split a task up, ask Codex to propose a plan.”</p>
<p>“Take time to make your prompts as relevant as possible, just as you would when helping a new teammate scope a task,” Google advises in its <a href="https://cloud.google.com/blog/topics/developers-practitioners/five-best-practices-for-using-ai-coding-assistants" data-type="link" data-id="https://cloud.google.com/blog/topics/developers-practitioners/five-best-practices-for-using-ai-coding-assistants">Five Best Practices for Using AI Coding Assistants</a>. “Consider what details you need to share for a person to succeed, and provide all those details to your AI tool.”</p>
<p>And, don’t load up your context window to the LLM’s published limit. Performance often degrades as you get close to the maximum token limit.</p>
<h2 class="wp-block-heading" id="take-advantage-of-open-llms-particularly-if-you-have-budget-and-token-limits">Take advantage of open LLMs, particularly if you have budget and token limits </h2>
<p>Open-weight models — especially ones small enough to run on a desktop computer — may never rival frontier LLMs like Claude, GPT, or Gemini. But for a lot of R coding, they don’t need to.</p>
<p>To see how well an LLM can write R code, it should have the same kind of harness that Claude Code, Codex, or Posit Assistant gives larger LLMs. “The harness is the product,” <a href="https://sidecar.ai/blog/the-ai-harness-why-whats-built-around-the-model-matters-more-than-the-model-itself" data-type="link" data-id="https://sidecar.ai/blog/the-ai-harness-why-whats-built-around-the-model-matters-more-than-the-model-itself">argues Mallory Mejias</a> at Sidecar, an AI education company. “The model is the engine inside it — important, but increasingly interchangeable.” I wouldn’t go that far, since I still find the LLM to be a critical piece of this equation, and models differ. But the same LLM will perform differently depending on the tools and context it has.</p>
<p>Posit recently added Google’s open-source Gemma 4 26B to Assistant’s existing options of larger, closed-source LLMs. “Up until this point, models of this size — small enough to run comfortably on high-end consumer laptops — were on our radar but not yet capable enough to drive an agent harness like Posit Assistant,” Posit senior software engineer Simon Couch wrote on <a href="https://posit.co/blog/gemma-4-new-budget-focused-model-posit-ai" data-type="link" data-id="https://posit.co/blog/gemma-4-new-budget-focused-model-posit-ai">the Posit blog</a>. “This has changed in the last few months with releases like Gemma 4.” However, to use it in Posit Assistant in RStudio, you still need to run it through a Posit AI subscription, not locally. It uses 1/10 the budget that the same session with Claude Sonnet would consume.</p>
<p>Several projects can use Claude Code to run local LLMs. <a href="https://github.com/ollama/ollama" data-type="link" data-id="https://github.com/ollama/ollama">Ollama</a>, open-source software for running LLMs locally, does this with the terminal command <code>ollama launch claude --model <model-name></model-name></code>, such as:</p>
<pre class="wp-block-code"><code>ollama launch claude --model gemma4:26b
</code></pre>
<p>The makers of <a href="https://github.com/unslothai/unsloth" data-type="link" data-id="https://github.com/unslothai/unsloth">Unsloth</a>, an open-source framework for running and training models, say running local LLMs inside Claude Code can be very slow due to <a href="https://unsloth.ai/docs/basics/claude-code#fixing-90-slower-inference-in-claude-code" data-type="link" data-id="https://unsloth.ai/docs/basics/claude-code#fixing-90-slower-inference-in-claude-code">a cache invalidation issue</a>. They say this can be fixed by adding <code>"CLAUDE_CODE_ATTRIBUTION_HEADER" : "0"</code> to ~/.claude/settings.json under “env”.</p>
<p>Unsloth can also use Claude Code to run local models, but it’s somewhat more involved to install and set up than Ollama. See <a href="https://unsloth.ai/docs/basics/claude-code#unsloth-tutorial" data-type="link" data-id="https://unsloth.ai/docs/basics/claude-code#unsloth-tutorial">this Unsloth tutorial</a>. </p>
<p>Open-source coding agent <a href="https://github.com/anomalyco/opencode" data-type="link" data-id="https://github.com/anomalyco/opencode">Open Code</a> runs LLMs from OpenAI, Anthropic, Google, Kimi, Alibaba’s Qwen, and others. I haven’t warmed up to it yet compared with commercially backed CLI tools I’ve tried, but it has 174,000 GitHub stars so clearly a lot of fans. </p>
<p>AI consultant Thomas Wiegold, who <a href="https://thomas-wiegold.com/blog/i-switched-from-claude-code-to-opencode/" data-type="link" data-id="https://thomas-wiegold.com/blog/i-switched-from-claude-code-to-opencode/">switched from Claude to Open Code</a>, said “OpenCode covers everything I need for my daily workflow. It’s fast, the provider flexibility is genuinely useful rather than theoretical, and the TUI is better for extended sessions…. That said, I’m not going to pretend it’s all smooth sailing.” He noted some stability issues and a remote code execution vulnerability earlier this year. </p>
<p>Wiegold’s suggestion? “Install both. Try others. Stay flexible.”</p>
<h2 class="wp-block-heading" id="in-summary">In summary</h2>
<p>Large language models remain imperfect and unpredictable tools, but they’re improving rapidly — as are the harnesses around them. Even frontier LLMs in commercial coding agents can ignore instructions at times and otherwise behave in unpleasantly surprising ways. However, you’ll vastly improve your chances of generating quality code if you use coding agents, take time to set them up with quality instructions, and remember good prompting techniques. Good luck!</p>
<p>—</p>
<h2 class="wp-block-heading" id="coding-agent-info">Coding agent info</h2>
<p><a href="https://www.anthropic.com/product/claude-code" data-type="link" data-id="https://www.anthropic.com/product/claude-code">Claude Code</a> – Anthropic’s coding agent is available in the terminal and as an IDE extension, a desktop app, mobile apps, Slack app, and a cloud version that can work directly in your repos on GitHub. Arguably the first breakthrough coding agent, it now includes an elegant remote-control option where you can start a session on your desktop and then continue it on a phone or tablet. While designed for Claude LLMs, you can use it to run local LLMs via tools like Ollama or Unsloth Studio. When using Claude models, it tends to offer less usage than other vendors’ options. Anthropic has posted the terminal version’s code on GitHub, with usage governed by Anthropic’s commercial terms of service. You can use Claude Code via API pay per use or as part of a Claude subscription.</p>
<p><a href="https://developers.openai.com/codex" data-type="link" data-id="https://developers.openai.com/codex">Codex</a> – OpenAI’s answer to Claude Code is available via terminal, app, IDE extension, or cloud, with integrations for GitHub, Slack, and Linear. The CLI tool is open source under an Apache license, <a href="https://github.com/openai/codex" data-type="link" data-id="https://github.com/openai/codex">available on GitHub</a>, and works via API pay per use or as part of a ChatGPT subscription.</p>
<p><a href="https://posit-dev.github.io/assistant/" data-type="link" data-id="https://posit-dev.github.io/assistant/">Posit Assistant</a> – The Posit (formerly RStudio) coding agent is designed for data work in R and Python, although Posit senior software engineer Simon Couch told me it would do fine for programming work for other uses and in other languages. It is available for the RStudio IDE (via a Posit AI subscription in RStudio), Positron IDE, and as a stand-alone CLI terminal app, although the CLI tool currently doesn’t feel as full-featured as the IDE integrations (that may change). As of this writing, you can also use API keys from Anthropic, OpenAI, and Snowflake Cortex and a GitHub Copilot account in Positron. Posit Assistant is being updated quite frequenty, Couch told me. If you tested it a month or two ago, it has more capabilities now.</p>
<p><a href="https://antigravity.google/product/antigravity-cli" data-type="link" data-id="https://antigravity.google/product/antigravity-cli">Antigravity CLI</a> – Google’s terminal-based tool will be replacing the Gemini CLI for unpaid and Google One users this month. There’s also an Antigravity IDE, SDK, and platform for orchestrating multiple agents. One advantage of Antigravity: Google offers a free plan with access to Gemini 3.5 Flash, Gemini 3.1 Pro, Gemini 3 Flash, Claude Sonnet 4.6, Claude Opus 4.6, and gpt-oss-120b, and what it describes as “generous” but undefined weekly rate limits based on “the degree we have capacity.”</p>
</div>
</div>
</div>
</div>
</div><p>The post <a href="https://www.azalio.io/10-tips-for-getting-better-r-code-from-your-ai-coding-agent/">10 tips for getting better R code from your AI coding agent</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Code like Hemingway</title>
		<link>https://www.azalio.io/code-like-hemingway/</link>
		
		<dc:creator><![CDATA[Azalio tdshpsk]]></dc:creator>
		<pubDate>Wed, 17 Jun 2026 09:59:34 +0000</pubDate>
				<category><![CDATA[Cloud]]></category>
		<guid isPermaLink="false">https://www.azalio.io/code-like-hemingway/</guid>

					<description><![CDATA[<p>I was blessed with a terrific high school English teacher. Ms. Jewel was funny, kind, interesting, and tough on us. I can still spell “ecstasy” on the first try because of her. One of the more memorable lessons she taught was a “Hemingway and Fitzgerald” module.  I loved the short stories of both, but Hemingway’s work always stuck [&#8230;]</p>
<p>The post <a href="https://www.azalio.io/code-like-hemingway/">Code like Hemingway</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></description>
										<content:encoded><![CDATA[<div>
<div id="remove_no_follow">
<div class="grid grid--cols-10@md grid--cols-8@lg article-column">
<div class="col-12 col-10@md col-6@lg col-start-3@lg">
<div class="article-column__content">
<section class="wp-block-bigbite-multi-title">
<div class="container"></div>
</section>
<p>I was blessed with a terrific high school English teacher. Ms. Jewel was funny, kind, interesting, and tough on us. I can still spell “ecstasy” on the first try because of her. One of the more memorable lessons she taught was a “Hemingway and Fitzgerald” module. </p>
<p>I loved the short stories of both, but Hemingway’s work always stuck out for me. It’s cliché to say that Hemingway was terse, but we only say it because it is true. He could pack more into a page than most writers. He didn’t waste a syllable. </p>
<p>Software developers are soon going to have to take a lesson from Hemingway.</p>
<p>It’s not hard to be concise in code. You have to be, by design. The compiler won’t put up with any throat-clearing and jibber-jabber. You can’t meander. You have to use a very strict set of words and symbols, and you have to use them in a defined way. </p>
<p>Thanks to agentic development, <a href="https://www.infoworld.com/article/4096265/writing-code-is-so-over.html">we don’t write code anymore</a>. But we are writing for the agents. A lot. </p>
<h2 class="wp-block-heading" id="all-about-the-spec">All about the spec</h2>
<p>Let me explain. Claude Code, Codex, Copilot, and the rest all love to be “spoken” to <a href="https://www.infoworld.com/article/4146579/markdown-is-now-a-first-class-coding-language-deal-with-it.html">in Markdown</a>. We used to define our code with unit tests and specifications written for humans. Now, it’s all about the spec. And the spec needs to be both complete and concise. </p>
<p>It needs to be complete in the sense that if you leave something out or forget to define something, the agent will very likely fill in the gaps for you. Forget a feature or requirement, and the agent will go off confidently and almost certainly in the wrong direction.</p>
<p>At the same time, you need to be concise. Because if you are too effusive, you may give the agent ideas that you don’t want it to have. If you write like Fitzgerald — lush and expansive —  the agent will be off and running in a direction you can’t be sure about. </p>
<p>Our compilers will stop cold if we aren’t precise and concise. Our agents? They will proceed with confidence, diligently producing a result that looks great but isn’t what we want.</p>
<p>So we need to learn to write like Hemingway and pack as much meaning into as few words as possible. Or, as Paul Graham said:</p>
<div class="extendedBlock-wrapper block-coreImage undefined">
<figure class="wp-block-image size-full"><img loading="lazy" decoding="async" src="http://www.azalio.io/wp-content/uploads/2026/06/paul_graham_tweet_08dac6.png" alt="paul graham tweet" class="wp-image-4185837" width="720" height="306" sizes="auto, (max-width: 720px) 100vw, 720px"></figure>
<p class="imageCredit">Foundry</p>
</div>
<h2 class="wp-block-heading" id="lets-be-clear">Let’s be clear</h2>
<p>For years, we’ve been writing semi-vague Jira tickets and expecting our fellow humans to fill in the gaps and “get the gist” of what we were trying to do. We don’t have that luxury anymore. Developers who aren’t able to move on from “Jira-speak” to “Agent-speak” are going to struggle. If we don’t follow Graham’s advice in writing those specs, who knows what we’ll get.</p>
<p>Maybe someday agents will know the ins and outs of our business well enough to “know what we mean,” but they can’t today. The time we used to spend coding now needs to be spent perfecting our new craft of writing precise specifications for our agents to execute.</p>
<p>Every <a href="https://en.wikipedia.org/wiki/Listicle">listicle</a> ever written on “How to be a Great Developer” says you need to be a great communicator — but that was communication with other humans. </p>
<p>Ms. Jewel always taught us that our writing is only as clear as our thinking. Our compilers forced us to write clearly. Our coding agents won’t do that. I’m sure there are many developers out there now who wish they had listened more closely to their high school English teacher.</p>
</div>
</div>
</div>
</div>
</div><p>The post <a href="https://www.azalio.io/code-like-hemingway/">Code like Hemingway</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>SpaceX’s planned $60 billion deal for Cursor raises questions for CIOs</title>
		<link>https://www.azalio.io/spacexs-planned-60-billion-deal-for-cursor-raises-questions-for-cios/</link>
		
		<dc:creator><![CDATA[Azalio tdshpsk]]></dc:creator>
		<pubDate>Tue, 16 Jun 2026 20:59:30 +0000</pubDate>
				<category><![CDATA[Cloud]]></category>
		<guid isPermaLink="false">https://www.azalio.io/spacexs-planned-60-billion-deal-for-cursor-raises-questions-for-cios/</guid>

					<description><![CDATA[<p>When SpaceX on Tuesday officially announced its plan to purchase AI coding startup Cursor for $60 billion in stock, as it had predicted it would do in April, it presented CIOs and developers with a little good news, a little bad news and a massive pile of uncertainty.  The details of the proposed acquisition were [&#8230;]</p>
<p>The post <a href="https://www.azalio.io/spacexs-planned-60-billion-deal-for-cursor-raises-questions-for-cios/">SpaceX’s planned $60 billion deal for Cursor raises questions for CIOs</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></description>
										<content:encoded><![CDATA[<div>
<div id="remove_no_follow">
<div class="grid grid--cols-10@md grid--cols-8@lg article-column">
<div class="col-12 col-10@md col-6@lg col-start-3@lg">
<div class="article-column__content">
<section class="wp-block-bigbite-multi-title">
<div class="container"></div>
</section>
<p>When SpaceX on Tuesday officially announced its plan to purchase AI coding startup Cursor for $60 billion in stock, as it had <a href="https://www.infoworld.com/article/4161997/spacex-secures-option-to-acquire-ai-coding-startup-cursor-for-60b.html" target="_blank" rel="noopener">predicted it would do in April</a>, it presented CIOs and developers with a little good news, a little bad news and a massive pile of uncertainty. </p>
<p>The <a href="https://d18rn0p25nwr6d.cloudfront.net/CIK-0001181412/06fd909f-8de8-4960-aa6e-6b8cd3da6c9e.pdf" target="_blank" rel="noreferrer noopener">details of the proposed acquisition</a> were virtually identical to the terms announced in April, even retaining the $10 billion consolation prize for Cursor should SpaceX back out of the deal.  </p>
<p>But for CIOs and developers, the increasing probability of the deal happening is forcing them to make long-term decisions without many long-term answers about what is likely to happen to Cursor, which says its coding agents are used by 64% of Fortune 500 companies.</p>
<p>But whether the deal is good news for Cursor’s customers, or even those of its rivals, is an open question.</p>
<p><a href="https://my.idc.com/getdoc.jsp?containerId=PRF004946" target="_blank" rel="noreferrer noopener">Arnal Dayaratna</a>, research VP for software development at IDC, is firmly in the good news camp. He argued that the main element holding back the company, which he estimated brings in about $2 billion in annual revenue, was access to GPUs. And, he said, SpaceX’s xAI has the ability to resolve that problem.</p>
<h2 class="wp-block-heading" id="its-all-about-the-gpus">It’s all about the GPUs</h2>
<p>“The implication is that Cursor can get better because it will get better access to more compute, in the form of GPUs. That was what Cursor was struggling to obtain,” Dayaratna said, adding that Cursor joins both Anthropic and Google in aligning with xAI. </p>
<p>“It is bad news for Anthropic,” he said, pointing out, “[Elon] Musk figured out that GPUs were the limiters, the bottleneck. Not human talent: compute. And it’s not only about GPUs. It’s about the data centers,” as well as the mechanisms needed to support data centers, including electricity access, cooling apparatus, and zoning rights.</p>
<p>“Zoning rights are very difficult to obtain in the United States, and Elon has been able to leverage his political capital” to obtain those rights, Dayaratna said. </p>
<p>Dayaratna also commented on the highly unusual tactic of a company pre-announcing an acquisition and then announcing it “officially” two months later. The reason, he suggested, is that Musk didn’t want the acquisition to delay the SpaceX IPO, but he <em>did </em>want potential investors to know about it. “It gave the IPO more legs, even though it was not formally a part of the IPO. He wanted to socialize it before it happened,” he said.</p>
<p><a href="https://moorinsightsstrategy.com/team/jason-andersen/" target="_blank" rel="noreferrer noopener">Jason Andersen</a>, principal analyst at Moor Insights &amp; Strategy, took a far more cautious approach and predicted, “enterprise customers will be very concerned.”</p>
<h2 class="wp-block-heading" id="time-for-cio-due-diligence">Time for CIO due diligence</h2>
<p>“xAI’s models and treatment of guardrails are very different than what Cursor has stood for,” Andersen said, and the key issue involves model choice. “Will Cursor be able to point at models other than Grok? If that is the case, it could go towards helping customers be OK with things for a while,” he said. </p>
<p>But Andersen stressed the words “for a while.” He argued that Cursor rivals have gotten more sophisticated, which could give enterprise CIOs more alternatives.</p>
<p>“The tooling options from the classic enterprise vendors have gotten really good, and also have a number of features that enable better governance and teaming. For example, Kiro from AWS is already a worthy competitor to Cursor, but there have also been major improvements from Google and Microsoft this year,” Andersen said. “I’d venture to say that Cursor was already facing a substantial competitive challenge in the enterprise, especially as the vibe coding movement lost some steam towards agents that enable so much more developer capability.”</p>
<p>Andersen added, “A <em>lot</em> of SpaceX’s valuation is pinned to xAI and there wasn’t enough IP there for it to be long-term competitive with Anthropic or OpenAI without rapidly injecting new technology.”</p>
<p><a href="https://www.infotech.com/profiles/shashi-bellamkonda" target="_blank" rel="noreferrer noopener">Shashi Bellamkonda</a>, principal research director at Info-Tech Research Group, said the current period of due diligence gives CIOs opportunities to ask difficult questions.</p>
<p>“I do believe that enterprise customers will start to ask harder questions now, especially if more of the processing starts moving through Grok or xAI infrastructure,” Bellamkonda said, but he stressed that he does see a SpaceX deal helping Cursor.</p>
<p>“Cursor is already carrying a lot of LLM cost and if SpaceX and xAI can give it access to more compute and a faster, improved Grok model, customers could eventually get better performance at a lower cost. Grok is getting faster and more accurate, and under SpaceX there may be more enterprise appetite for xAI than xAI had on its own,” Bellamkonda said.</p>
<h2 class="wp-block-heading" id="questions-about-data-privacy">Questions about data privacy</h2>
<p>But Bellamkonda also pointed to the trust issue, specifically asking whether Cursor would be allowed to continue its zero-data-retention policy under the new ownership.</p>
<p>“Enterprise users with sensitive codebases will want to understand exactly where their data is going, who has access to it, and whether any prompts, code, metadata, or embeddings are touching SpaceX or xAI systems,” Bellamkonda said. “Zero data retention is only valuable if customers believe the controls still apply across the new infrastructure. If they can preserve Cursor’s privacy posture while using xAI and SpaceX compute to improve the product, this could be a very strong outcome for customers. If the data governance is unclear, enterprise buyers will hesitate.”</p>
<p><a href="https://acceligence.com/talent/profiles/justin-greis/" target="_blank" rel="noreferrer noopener">Justin Greis</a>, CEO of consulting firm Acceligence, echoed Bellamkonda’s concerns.</p>
<p>“For many enterprise customers, Cursor’s zero-data-retention policy was not simply a security feature. It was a foundational part of the procurement and approval process,” Greis said, pointing out that security teams, legal departments, compliance officers, and executive leadership all became more comfortable adopting AI-assisted development because they believed their source code, prompts, and proprietary intellectual property would not be retained, stored, or repurposed.</p>
<p>“Whenever ownership changes, customers naturally revisit those assumptions,” he said. “I would expect enterprise customers to immediately seek clarity on whether existing zero-data-retention commitments remain unchanged, whether those commitments continue to be contractually enforceable, and whether any future modifications could be introduced through changes to terms of service or product strategy.”</p>
<h2 class="wp-block-heading" id="reassessing-platform-risk">Reassessing platform risk</h2>
<p>Another concern is whether the move will create greater vendor concentration, and whether that will increase risk exposure. </p>
<p>“Many organizations are already concerned about becoming overly dependent on a small number of AI providers. If Cursor becomes more tightly integrated into a broader SpaceX technology ecosystem, some enterprises may worry about reduced flexibility and fewer alternatives. I suspect this will be a continued question with SpaceX’s recent IPO and growth plans,” Greis said. </p>
<p>Additionally, he pointed out, “AI coding platforms are rapidly becoming part of critical software delivery infrastructure. When organizations standardize on these tools, they are making a long-term platform decision that influences developer productivity, software quality, security processes, and engineering workflows. A change in ownership naturally prompts a reassessment of platform risk, roadmap alignment, and long-term strategic direction.”</p>
<p>As well, <a href="https://greyhoundresearch.com/svg/" target="_blank" rel="noreferrer noopener">Sanchit Vir Gogia</a>, chief analyst at Greyhound Research, noted, “zero data retention survives only where it stays contractual, auditable, enforceable and fenced from affiliate use. The deeper point is what Cursor has become. It is no longer a developer convenience. It sits inside the act of software creation, close to the intellectual-property bloodstream of the enterprise. That is a control plane, and control planes change hands rarely and consequentially.”</p>
<p>Gogia added that even before an acquisition, Cursor was already weakening its data guarantee. “Even now the promise is layered. Cursor’s own disclosures show that standard Privacy Mode still permits some code data to be stored for product features, while only the stricter legacy setting retains nothing. Zero is the exception, not the default.”</p>
</div>
</div>
</div>
</div>
</div><p>The post <a href="https://www.azalio.io/spacexs-planned-60-billion-deal-for-cursor-raises-questions-for-cios/">SpaceX’s planned $60 billion deal for Cursor raises questions for CIOs</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>[In preview] Public Preview: New project templates and template gallery for Azure Functions in VS Code</title>
		<link>https://www.azalio.io/in-preview-public-preview-new-project-templates-and-template-gallery-for-azure-functions-in-vs-code/</link>
		
		<dc:creator><![CDATA[Azalio tdshpsk]]></dc:creator>
		<pubDate>Tue, 16 Jun 2026 20:00:03 +0000</pubDate>
				<category><![CDATA[Cloud]]></category>
		<guid isPermaLink="false">https://www.azalio.io/in-preview-public-preview-new-project-templates-and-template-gallery-for-azure-functions-in-vs-code/</guid>

					<description><![CDATA[<p>The Azure Functions extension for VS Code now provides a redesigned Create New Project experience in public preview, featuring a rich, visual template gallery that replaces the multi-step Quick Pick wizard with a searchable, filterable view of project tem</p>
<p>The post <a href="https://www.azalio.io/in-preview-public-preview-new-project-templates-and-template-gallery-for-azure-functions-in-vs-code/">[In preview] Public Preview: New project templates and template gallery for Azure Functions in VS Code</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></description>
										<content:encoded><![CDATA[<div>The Azure Functions extension for VS Code now provides a redesigned Create New Project experience in public preview, featuring a rich, visual template gallery that replaces the multi-step Quick Pick wizard with a searchable, filterable view of project tem</div><p>The post <a href="https://www.azalio.io/in-preview-public-preview-new-project-templates-and-template-gallery-for-azure-functions-in-vs-code/">[In preview] Public Preview: New project templates and template gallery for Azure Functions in VS Code</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>[Launched] Generally Available: Log Analytics Summary Rules experience</title>
		<link>https://www.azalio.io/launched-generally-available-log-analytics-summary-rules-experience/</link>
		
		<dc:creator><![CDATA[Azalio tdshpsk]]></dc:creator>
		<pubDate>Tue, 16 Jun 2026 18:00:02 +0000</pubDate>
				<category><![CDATA[Cloud]]></category>
		<guid isPermaLink="false">https://www.azalio.io/launched-generally-available-log-analytics-summary-rules-experience/</guid>

					<description><![CDATA[<p>A new Azure portal experience for Summary Rules in Log Analytics is now available. Summary Rules enable aggregation of high-volume log data at a defined cadence and store it in summarized tables for improved query performance, reporting, or enhanced data</p>
<p>The post <a href="https://www.azalio.io/launched-generally-available-log-analytics-summary-rules-experience/">[Launched] Generally Available: Log Analytics Summary Rules experience</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></description>
										<content:encoded><![CDATA[<div>A new Azure portal experience for Summary Rules in Log Analytics is now available. Summary Rules enable aggregation of high-volume log data at a defined cadence and store it in summarized tables for improved query performance, reporting, or enhanced data</div><p>The post <a href="https://www.azalio.io/launched-generally-available-log-analytics-summary-rules-experience/">[Launched] Generally Available: Log Analytics Summary Rules experience</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Databricks pitches LTAP as a new foundation for agentic applications</title>
		<link>https://www.azalio.io/databricks-pitches-ltap-as-a-new-foundation-for-agentic-applications/</link>
		
		<dc:creator><![CDATA[Azalio tdshpsk]]></dc:creator>
		<pubDate>Tue, 16 Jun 2026 13:59:34 +0000</pubDate>
				<category><![CDATA[Cloud]]></category>
		<guid isPermaLink="false">https://www.azalio.io/databricks-pitches-ltap-as-a-new-foundation-for-agentic-applications/</guid>

					<description><![CDATA[<p>As enterprises rush to build AI agents that can reason over business data and take action, Databricks argues that the long-standing practice of separating operational and analytical data systems is turning into a liability. That separation, the cloud-based data warehouse provider says, is becoming increasingly strained as AI agents require simultaneous access to live operational [&#8230;]</p>
<p>The post <a href="https://www.azalio.io/databricks-pitches-ltap-as-a-new-foundation-for-agentic-applications/">Databricks pitches LTAP as a new foundation for agentic applications</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></description>
										<content:encoded><![CDATA[<div>
<div id="remove_no_follow">
<div class="grid grid--cols-10@md grid--cols-8@lg article-column">
<div class="col-12 col-10@md col-6@lg col-start-3@lg">
<div class="article-column__content">
<section class="wp-block-bigbite-multi-title">
<div class="container"></div>
</section>
<p>As enterprises rush to build AI agents that can reason over business data and take action, Databricks argues that the long-standing practice of separating operational and analytical data systems is turning into a liability.</p>
<p>That separation, the cloud-based data warehouse provider says, is becoming increasingly strained as AI agents require simultaneous access to live operational data and historical context to make decisions and take actions in real time, unlike humans, who traditionally can work with data that is minutes or hours old.</p>
<p>At its annual Data + AI Summit, the data warehouse provider introduced Lake Transactional and Analytical Processing (LTAP), a new architecture designed to unify transactional and analytical data on a single storage layer.</p>
<p>The new approach, according to Databricks, differs from traditional <a href="https://www.infoworld.com/article/2334535/what-is-oltp-the-backbone-of-ecommerce.html">online transaction processing (OLTP)</a> and <a href="https://www.infoworld.com/article/2334471/what-is-olap-analytical-databases.html">online analytical processing (OLAP)</a> architectures, which typically store operational and analytical data in separate systems.</p>
<p>Traditionally, OLTP databases are optimized for running day-to-day business operations such as order processing, payments, and inventory updates, while OLAP systems are designed for large-scale analytical queries and reporting.</p>
<p>As a result, enterprises often need to rely on <a href="https://www.infoworld.com/article/3487711/the-definitive-guide-to-data-pipelines.html">ETL pipelines</a>, data replication, and separate infrastructure to move information between the two environments.</p>
<p>LTAP, Databricks said, seeks to eliminate the reliance on ETL pipelines, replicated databases or separate data copies by storing data once in a shared <a href="https://www.infoworld.com/article/2334907/review-databricks-lakehouse-platform.html">lakehouse</a> layer while allowing dedicated compute engines to handle transactional and analytical workloads independently.</p>
<p>This approach, the company argued, provides AI-driven agents and applications access to both live operational data and historical analytical context without requiring data movement or duplicate copies.</p>
<h2 class="wp-block-heading" id="developer-simplicity-in-the-agentic-era">Developer simplicity in the agentic era</h2>
<p>Analysts, too, agree with Databricks’ contention that AI agents place new demands on enterprise data architectures.</p>
<p>“Agents don’t behave like people, or even like the apps we built for people. They read for context, loop, try things, then write something back, thousands of times over in ways you can’t fully predict. At that volume, the constant bouncing between production and analytics systems starts becoming the bottleneck. The pressure to collapse that gap is real, and LTAP is one way to approach it,” said <a href="https://moorinsightsstrategy.com/team/mike-leone/" target="_blank" rel="noreferrer noopener">Michael Leone</a>, principal analyst at Moor Insights and Strategy.</p>
<p><a href="https://www.linkedin.com/in/bhupendrachopra/" target="_blank" rel="noreferrer noopener">Bhupendra Chopra</a>, cofounder and CRO at IT consulting firm Kanerika, pointed out that an autonomous agent’s data access pattern makes the traditional architectures brittle: “We’re seeing this directly with clients deploying multi-agent systems, the pipeline layer becomes the ceiling almost immediately as an agent runs hundreds of times per task.”</p>
<p>The analysts also pointed out that the ability to collapse the gap between OLAP and OLTP is likely to help developers design more robust AI agents or applications that enterprises are currently targeting to deploy.</p>
<p>“The most interesting workflow or application patterns are real-time, context-aware applications that combine transactions, analytics, and AI in one flow,” said <a href="https://www.linkedin.com/in/slwalter/" target="_blank" rel="noreferrer noopener">Stephanie Walter</a>, practice leader of AI stack at HyperFRAME Research.</p>
<p>“Examples include AI agents that update customer workflows while seeing historical account context and fraud systems that act on live transactions and long-term behavioral patterns,” Walter added.</p>
<p>Designing such applications today, however, according to Leone, would require developers to pull together data from transactional systems, data warehouses, vector databases, and other sources through custom integrations, creating significant engineering complexity and maintenance overhead.</p>
<h2 class="wp-block-heading" id="operational-simplicity-and-governance-gains-for-cios">Operational simplicity and governance gains for CIOs</h2>
<p>For CIOs,  LTAP’s ability to reduce that engineering complexity, according to <a href="https://www.hfsresearch.com/team/ashish-chaturvedi/" target="_blank" rel="noreferrer noopener">Ashish Chaturvedi</a>, leader of executive research at HFS Research, will result in operational simplicity as well as cost savings.</p>
<p>“Most prominent advantage would be fewer data pipelines and everything that cascades from eliminating them. Most enterprises don’t realize how much of their data engineering budget is pure plumbing maintenance,” Chaturvedi said.</p>
<p>Kanerika’s Chopra pointed out that a substantial portion of data engineering capacity in mid-to-large enterprises today is consumed by maintaining synchronization between transactional and analytical systems.</p>
<p>The implications, however, Chaturvedi noted, are not limited to developer productivity, architectural simplicity, or cost savings: “The strategic prize is simplified governance. When you have one copy of data under one governance model instead of the same data scattered across operational stores, replicas, warehouses, and vector databases, you’ve solved the governance fragmentation problem.”</p>
<p>That simplification, according to Chopra, will matter operationally for enterprises deploying multiple AI agents, as these workflows can amplify governance gaps at a speed and scale that no human workflow ever did.</p>
<h2 class="wp-block-heading" id="ltap-versus-htap">LTAP versus HTAP</h2>
<p>Despite all its benefits, though, LTAP isn’t the first effort to unify operational and analytical workloads under a single architecture and for years.</p>
<p>The industry has pursued a similar goal through <a href="https://www.infoworld.com/article/2260125/how-in-memory-computing-drives-digital-transformation-with-htap.html">Hybrid Transactional and Analytical Processing (HTAP)</a> architecture, which sought to combine operational and analytical workloads on tightly coupled infrastructure to serve both workload types from the same system.</p>
<p>LTAP, in contrast, separates storage from compute, allowing different engines to access a common data layer while remaining independently scalable, Databricks said.</p>
<p>That separation of compute engines is why analysts think that LTAP might be a better bet than HTAP.</p>
<p>“HTAP never took off because asking one tightly bound system to be great at transactions and great at analytics usually left it mediocre at both, so customers ended up paying a premium for that compromise,” Leone said.</p>
<p>“I think separating storage from compute is the right instinct, and it’s the same move that made the modern cloud data world work in the first place. It matters because the thing that sank HTAP was one workload starving the other, and giving each side its own dedicated engine is exactly how you keep that from happening,” Leone added.</p>
<p>Another reason for HTAP’s failure, according to <a href="https://isg-one.com/about-us/people/david-menninger">David Menninger</a>, executive director of software research at ISG, was its requirement for enterprises to replace existing data platforms with a new architecture.</p>
<p>LTAP, by contrast, builds on the now-common practice of separating compute and storage, making the addition of an operational layer less of an architectural transformation and potentially lowering the barrier to adoption, Menninger added.</p>
<h2 class="wp-block-heading" id="not-yet-the-default-architecture-for-ai-agents">Not yet the default architecture for AI agents</h2>
<p>However, despite the enthusiasm around LTAP, analysts warned CIOs against viewing this as the inevitable successor to existing data architectures.</p>
<p>“CIOs will still need to choose their data architecture based on latency, reliability, ecosystem fit, cost, compliance, and developer experience,” Walter said.</p>
<p>Echoing Walter, Chaturvedi pointed out that for LTAP to become the de facto standard for the industry, Databricks will need more than architectural elegance: “The architecture looks sound on paper. The proof will be in the commit-to-query latency numbers under real load.”</p>
<p>LTAP, Databricks said, is expected to be released soon as part of <a href="https://www.infoworld.com/article/4007541/databricks-data-ai-summit-2025-five-takeaways-for-data-professionals-developers.html">Lakebase,</a> without providing any specific timelines.</p>
</div>
</div>
</div>
</div>
</div><p>The post <a href="https://www.azalio.io/databricks-pitches-ltap-as-a-new-foundation-for-agentic-applications/">Databricks pitches LTAP as a new foundation for agentic applications</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></content:encoded>
					
		
		
			</item>
		<item>
		<title>Nvidia PCs don’t need cloud for AI</title>
		<link>https://www.azalio.io/nvidia-pcs-dont-need-cloud-for-ai/</link>
		
		<dc:creator><![CDATA[Azalio tdshpsk]]></dc:creator>
		<pubDate>Tue, 16 Jun 2026 09:59:45 +0000</pubDate>
				<category><![CDATA[Cloud]]></category>
		<guid isPermaLink="false">https://www.azalio.io/nvidia-pcs-dont-need-cloud-for-ai/</guid>

					<description><![CDATA[<p>Nvidia’s new RTX Spark is one of the most interesting personal computing announcements in years. That’s because it’s not just another PC platform, but tries to redefine the role of the personal computer in the age of AI. Announced at Computex 2026, RTX Spark is Nvidia’s new platform for slim Windows laptops and compact desktops, [&#8230;]</p>
<p>The post <a href="https://www.azalio.io/nvidia-pcs-dont-need-cloud-for-ai/">Nvidia PCs don’t need cloud for AI</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></description>
										<content:encoded><![CDATA[<div>
<div id="remove_no_follow">
<div class="grid grid--cols-10@md grid--cols-8@lg article-column">
<div class="col-12 col-10@md col-6@lg col-start-3@lg">
<div class="article-column__content">
<section class="wp-block-bigbite-multi-title">
<div class="container"></div>
</section>
<p>Nvidia’s new RTX Spark is one of the <a href="https://www.nvidia.com/en-us/products/rtx-spark/">most interesting personal computing announcements</a> in years. That’s because it’s not just another PC platform, but tries to redefine the role of the personal computer in the age of <a href="https://www.infoworld.com/article/4061121/a-brief-history-of-ai.html">AI</a>. Announced at Computex 2026, RTX Spark is Nvidia’s new platform for slim Windows laptops and compact desktops, designed to combine an Arm-based CPU, Blackwell-based RTX graphics, and a large, unified memory architecture into a single AI-first computing system.</p>
<p>We have all grown accustomed to a cloud-centric AI model over the past few years. We open an application, send a request over the network, and a hosted service in a distant data center provides the intelligence. ChatGPT, Grok, Gemini, and similar systems have trained the market to think of AI as something that lives elsewhere. RTX Spark proposes a different model. It asks a simple yet disruptive question: What if the model, the agent, the data, and the application could all live on your own machine? Nvidia is not just selling a faster PC. It is selling a new architectural premise.</p>
<h2 class="wp-block-heading" id="features-functions-and-prices">Features, functions, and prices</h2>
<p>On paper, RTX Spark is designed to be a highly capable local AI system. Nvidia has described the platform as combining AI acceleration and RTX graphics on a single chip for thin laptops and small desktops. Public specifications for the platform indicate configurations with up to 6,144 Blackwell GPU cores, up to a 20-core CPU, up to 1 petaflop of FP4 AI performance, and up to 128GB of unified memory. These are not ordinary PC numbers. They are clearly intended to support serious local AI workloads.</p>
<p>The unified memory approach is especially important. In traditional PC architecture, the CPU and GPU often use separate memory pools, which can become a bottleneck when running large models. By contrast, RTX Spark’s design is intended to make it easier for the system to host and run AI models locally. This enables Nvidia to position the machine as capable of hosting persistent <a href="https://www.infoworld.com/article/3611465/how-ai-agents-will-transform-the-future-of-work.html">AI agents</a>, supporting local inference, and even allowing users to customize or fine-tune certain classes of language models.</p>
<p>Nvidia is also careful not to frame the system as only an AI box. In a smart move, the company is marketing RTX Spark for gaming, creative applications, AI development, and agentic workflows. This has been designed not as a one-trick pony, but as a capable computer first and an AI workstation second. Otherwise, it remains a niche developer experiment.</p>
<p>Pricing remains uncertain because Nvidia hasn’t published a universal price for every RTX Spark laptop or desktop. The platform will appear in products from different manufacturers, which means prices will vary. The best indicator comes from the related DGX Spark desktop, listed at about $4,699, though early estimates placed it between $2,999 and $3,999.</p>
<p>That probably gives us the right way to think about pricing for this broader category. These are unlikely to be inexpensive mainstream PCs, at least not at launch. They are more likely to arrive as premium systems aimed at developers, technical professionals, creators, and early adopters willing to pay for high-end AI capabilities on the device. Over time, that may broaden. For now, however, this looks like a new high-value, high-cost category rather than a commodity PC refresh.</p>
<h2 class="wp-block-heading" id="what-is-its-real-purpose">What is its real purpose?</h2>
<p>The most important thing about RTX Spark is not the chip. It is the purpose behind the chip. This machine is ultimately built to run AI agents locally, and that is a bigger deal than it may seem at first glance. An AI agent is more than a chatbot. It persists state, accesses tools, works across applications, remembers context, automates tasks, and increasingly acts as a software-based worker. Nvidia is explicitly positioning Spark systems to run personal AI agents directly on the local machine, potentially around the clock. That creates a very different computing model from what most of us use today.</p>
<p>There is another important layer to this story. These systems are also being positioned as platforms on which users can build and run smaller, more limited, locally tuned versions of large language model systems. Put plainly, you may be able to create your own model-based assistant that runs directly on the RTX Spark. It will not be as broadly capable as a frontier model operated by OpenAI or another hyperscaler. It is likely to be less generally capable, narrower in its expertise, and more constrained by local hardware limits. But it will be yours, it will be local, and it will respond without relying on a remote <a href="https://www.infoworld.com/article/2269032/what-is-an-api-application-programming-interfaces-explained.html">API</a> call to a hosted AI service hundreds or thousands of miles away.</p>
<p>Such a shift is conceptually significant. For years, the AI industry has conditioned us to believe that serious intelligence must be centralized. RTX Spark suggests a future in which at least some intelligence becomes personal, portable, and self-contained.</p>
<h2 class="wp-block-heading" id="welcome-to-the-revolution">Welcome to the revolution</h2>
<p>The breakthrough here is not that local models will instantly outperform remote models. They will not. But the architecture of AI use may begin to diversify. Today, the default assumption is centralization. We assume the model, the knowledge base, and the application stack will all live in the cloud, and the user is simply a client. With systems like RTX Spark, that assumption starts to weaken. The model can run on the local machine. The agent can run on the local machine. Sensitive data can remain on the local machine. The application logic can be executed on the local machine. This changes latency, privacy, resiliency, and cost models. It also changes who controls the AI.</p>
<p>That does not mean the cloud goes away. Far from it. Enterprise use cases that benefit from centralized models and data will continue to exist. Businesses want the same knowledge base, business rules, database consistency, and governance model available to everyone. Centralization remains powerful because it reduces fragmentation and keeps systems aligned. Yes, a single-tenant, RTX Spark-based AI environment can be useful for certain projects, but it can also create islands of intelligence that do not easily share knowledge across teams and systems.</p>
<h2 class="wp-block-heading" id="possible-use-cases">Possible use cases</h2>
<p>I see the strongest potential use cases in disconnected or semi-disconnected environments. Think about physicians doing diagnostics support in privacy-sensitive contexts, field engineers collecting and interpreting data in remote areas, military and public sector users operating at the edge, or professionals who need highly private, self-contained AI assistance without relying on constant connectivity. In those scenarios, the value proposition is very strong. Having the model, data, application, and agent all on one portable system is not a limitation. It is the point.</p>
<p>The bigger question is whether mainstream enterprise AI will migrate in that direction. I remain skeptical that most organizations want hundreds or thousands of individually tuned, locally hosted models to replace centralized AI services. I predict that this category will complement the cloud rather than displace it. The more likely future is hybrid: centralized AI where shared knowledge and governance matter, and local AI where privacy, portability, latency, or disconnected operations matter more.</p>
</div>
</div>
</div>
</div>
</div><p>The post <a href="https://www.azalio.io/nvidia-pcs-dont-need-cloud-for-ai/">Nvidia PCs don’t need cloud for AI</a> first appeared on <a href="https://www.azalio.io">Azalio</a>.</p>]]></content:encoded>
					
		
		
			</item>
	</channel>
</rss>
