The hidden threat to AI performance

Most of us involved with AI are aware (or are quickly becoming aware) that memory bandwidth isn’t keeping pace with advancements in processing power. This imbalance creates a frustrating situation where GPUs are often underutilized, wasting compute power just as AI...

When it comes to AI, bigger isn’t always better

Enterprise AI tends to default to large language models (LLMs), overlooking small language models (SLMs). But bigger isn’t always better. Often, a smaller, more specialized model can do the work faster and more efficiently. What complicates things is that neither an...