Category Archives: AI

Claims of Complete Gen-AI Auditability Are Complete BullCr@p

Proponents of Gen-AI will argue that you should go all in on their next-gen LLMs because, unlike current systems and many humans (who are lousy keepers of record), their decisions, like their actions, are 100% auditable. And, again, that’s complete and utter bullcr@p.

Just because you can ask the LLMs to output their reasoning, and you can ask them to log everything they do from the minute you start the interaction, but, because the reasoning is all based on probabilistic math at a scale NO human can understand (and for which we have NO measurements yet), you have no idea WHY the LLM reasoned a certain something or IF the Gen-AI will reason the same way on the same request, even if that request is re-iterated only 5 minutes later!

You can simply search the internet for hundreds of examples out there of people giving the exact same prompt to the exact same LLM AI five minutes later and getting a slightly to completely different response.

Gen-AI LLMs don’t understand. They don’t actually reason. And they definitely don’t think! That’s why they are NOT auditable. And that’s also why they should NEVER make a decision. (However, since they can analyze more data, and for some tasks have, more often than not, achieved a competence beyond an average human happy to regress in IQ to the late neolithic era, they should definitely suggest decisions with their reasoning. But the LLMs should never, ever, execute on those decisions without human approval.)

The reality is that only an LLMs ability to log what was done in an immutable blockchain format is useful compared to an employee who knowingly did something wrong for a bad reason. Since the AI is not intelligent, and doesn’t have ethics, it has no reason NOT to log its reasoning and why an action was taken. But, as per above, the LLM is still NOT auditable.

If You Have Two “AI” “Agents” Talking to Each Other …

… then, as Stephen Klein of Curioser.AI points out, you have a puppet show, “except instead of sock puppets, we’re using large language models and API loops”!

Just because it happens autonomously, looks social, appears to have an identify, and fakes a dialogue, it doesn’t mean there is anything more to it than the modern equivalent of a puppet show.

Gen-AI is the ultimate show and if P.T. Barnum were alive today, it would be his ultimate circus. But unlike the scarecrow, it doesn’t have a brain. It may have the ability to harness more compute power and data than any algorithm we have developed to date, but it is still dumber than a pond snail.

It has very few valid uses. I’ve discussed some of them before, but let’s make it perfectly clear what little it can actually do:

  • natural language processing — and, properly trained, it can not only equal, but even exceed the best last generation tech in semantic and sentiment processing
  • large corpus search — while it will never be 100% accurate, it can find just a few potentially relevant documents among millions with few false positives and negatives
  • large corpus summarization — again, while it will never be 100% accurate, and most good summaries won’t be top tier, it can summarize large amounts of data, and usually extract just the relevant data in response to your query
  • idea retrieval — not generation, retrieval of ideas based on a review and summarization of petabytes of data; very relevant for users dependent on LLMs who are suffering minor to severe cognitive atrophy; with proper prompting this can take the form of
    • strategy / workflow suggestion
    • devil’s advocate
  • usage and workflow prediction during application development
  • rapid PROTOTYPE generation for usability and efficacy analysis
    (not enterprise application development)

The reality is that Gen-AI

  • cannot reason,
  • is not deterministic, and
  • is essentially nothing more than a meta-prediction engine;
  • is providing ideas based on meta-pattern identification,
  • is predicting based on a layered statistical model beyond ANY human understanding, and
  • generates code riddled with security issues and possibly even boundary errors;
  • and let’s not ignore the fact that hallucinations are a core function that CANNOT be trained out .

This means that often the only way to succeed with Gen-AI is to more-or-less abandon Gen-AI LLMs in production applications except as Natural Language Parsers (as they are easier to train to accuracy levels beyond last generation semantic parsers which could take months to train to high effectiveness — and I know this from personal experience), and revert back to the AI tech that was just reaching maturity and industrialization readiness that I was writing about in the late 2010s. The reality is that, if you are willing to use some elbow grease and put the hours in, you can create spectacular applications with last-generation tech, and then use Gen-AI as a natural language interface layer to simplify utilization, integration, and complex workflows. If you are willing to create the right guardrails, where the Gen-AI LLM can only trigger specific application services with specific data in specific contexts, with HUMAN approval, then you can use it responsibly. Otherwise, it’s a crapshoot as to the results you’ll get.

For example, you should never use it for negotiation, which can be as much as reading the other person, as this is a very risky application as the number of soft-based data points you need for a decent prediction typically far outnumbers what you have available … even for public figures where you believe you have lots and lots of data on them available to judge their reactions. But hey, if you want to lose your lunch money, and possibly your entire bank account, go ahead and let it act as your buyer (but if it can lose hundreds powering a vending machine, imagine how much it can lose on a seven to nine figure category).

Even though plenty of vendors will provide some very convincing demos that seem to indicate Gen-AI LLMs can do otherwise, don’t fall for the tricks. During the demo, The Wizard of Oz is hiding behind the curtain. The not-so-great thing about LLMs is that, for a very specific set of tasks/situations, they can be overtrained on a very specific corpus to over-perform against those tasks and greatly increase the chances that any demo they deliver to you works fantastically well.

However, what this also means, is that you definitely do not want to use the Gen-AI LLM for tasks that are quite distinct and significantly different than the tasks/situations the Gen-AI LLM was over-trained for as the Gen-AI LLM is going to perform quite poorly at best, and possibly quite disastrously at worst. The reality is that once the puppeteer is no longer pulling the strings, all bets as to efficiency and effectiveness are off.

The Gen-AI ringmasters are employing the same philosophy and same techniques that made some of the early spend auto-classification providers “leaders” with unheard of success rates compared to when the average organization employed similar auto-classification tech and got dismal results. (Because they just didn’t know what “AI” actually stood for!)

Don’t be fooled by the ringmasters. If you want results, lie its AI and buy solutions that work.

AI vs No AI – Let’s Make This Clear!

There are valid uses for AI, and valid AI models you should use. (LLMs are rarely one of them, having only a handful of reliable applications, but, when you push them aside, there are lots of other AI technologies that actually work if you don’t get blinded by the hype.) But there are invalid cases, and AI models you shouldn’t use. So to make it easy-peasy for you, here’s a simple guide!

USE AI WHEN DON’T USE AI
It’s a well constrained use-case where AI has been successfully deployed in industry, where the confidence is proven, and where you have access to the right technology. It’s a poorly defined use case, AI has not yet been successful for the use case, the confidence is unproven, and/or the tech you have access to is still in alpha!
It works as well as previous gen tech but with significantly shorter training cycles and easier integration and utilization. Previous gen tech still works better, costs less, and/or has no impediments to integration or UX.
It can bring value beyond what last generation tech can bring. All of the value can be achieved using traditional rules-based (A)RPA, (decision) optimization, analytics, or (classical) machine learning.
It’s a cost effective solution that can be run predictably based on a predictable cost model. It’s based (primarily) on LLM(-based) models that have unpredictable compute costs and, with the wrong request, can eat up thousands of dollars on a single request.
You have a valid use case for agentic tech! You think you have a valid usre case for agentic tech. (If you think you’re ready for AI, you’re NOT ready for AI.)
You’ve mastered current generation tech. You’re still a generation (or three) behind on tech.
You have in-house expertise on what AI is, and isn’t; where it can, and can’t be successfully deployed; and what “AI” is typically appropriate in a given situation. You’re relying entirely on (junior) consultants from the Big X promising it’s gonna “change your life“.
It’s designed to augment human performance and make your employees more productive and more effective super humans (able to do the work of 3, 5, 7, and even 10 regular humans). It’s designed to replace humans. (This doesn’t mean it can’t reduce the number required to do a task, just that at least one is still maintained to handle exceptions and make decisions.)
The firm is selling augmented intelligence! The firm is selling AI Employees. (There are none! And any firm that makes this claim is dehumanizing your employees! But hey, it’s your choice if you want to lose all your money.)

Get it yet?

There are MANY reasons you are NOT ready for AI!

A few weeks ago, we told you that if you think you’re ready for AI, you’re not ready for AI because, even though the vast majority of you are chasing AI, only a minority of you are ready to even investigate it. And we mean investigate, not use. That depends on whether or not there are any relevant AI solutions for you needs — and despite the repeated BS claims by the big AI vendors, there may not yet be any!

And it’s not just because you haven’t

  • admitted you’re only chasing AI because of FOMO and FUD
  • assessed where you are
  • realized you are generations of tech behind
  • determined you just don’t have the right resources

But it goes beyond that.

In order to have any hope of succeeding with AI:

You need great data and great Master Data Management
… but you don’t even know where your data is! You have no governance policies, no management processes to ensure data is kept up to date (or even backed up unless you have already suffered a data loss and determined losing that specific data would be disastrous), and no clue about what that entails. And even if you realize that you need (master) data management, you won’t get the C-Suite to sign off on it, even if you call it E-MDMA and tell them they’re getting free samples!
You need a good IT infrastructure, with context-based integration and workflow capability
… but you have no central strategy for data integration, system orchestration, or enterprise workflows, and your IT infrastructure is whatever cloud your ERP runs on. AI, especially Gen-AI, requires massive data and massive compute and, guess what, that requires massively powerful, solid, infrastructure — and yours is probably held together with spit, glue, and duct tape!
You need an in-depth understanding of not only the problem you want to solve, but what AI algorithm will actually work reliably and with measurable confidence
… but guess what? In order to properly evaluate AI, you need an advanced understanding of the technology, which usually requires an advanced, graduate level, understanding of the underlying mathematics as well as deep understanding of the problem and how to mathematically model it.
You need a strong technical quotient (TQ) to implement, train, and verify those AI algorithms
… and that’s more than just a single expert who can evaluate, but a strong bench of architects and developers to make it work — you can’t rely solely on the vendor as they can go away, their bench can leave, or they can get pressured by their investors to just sell, sell, sell (and pretend you don’t exist once they get the cheque) and that leaves you to your own skillsets.
You need domain experts on hand to verify the results
… and this goes double for critical results. If you are using an augmented intelligence to help with sourcing, market analysis, strategy recommendations, etc. you can’t let an agentic system execute on a computation without verifying it. No system ever has all the data, no system ever knows all of the options, and no system has the soft information (and how you might be able to work a sales rep to your advantage). And if someone messed up the data, considering just one wrong number can entirely throw off a hundred thousand variable model, you’re in deep doo-doo if the system executes an order without your verification.
You need to redesign your processes to optimally take advantage of AI
… because your processes come from the time before office machines existed, so obviously they weren’t designed for modern technology. And while traditional workflow / RPA can easily automate what you have (even though it shouldn’t), since AI requires good data, good structure, properly designed models, etc. — it’s not going to work with whatever Guilded Age process you’re using now.

And so on. The reality is, despite what all the big vendors, big consultancies, and big analyst firms tell you — you’re just not ready for AI. (And definitely NOT ready for big bang projects that will end in big busts!) It’s just the latest silicon snake oil panecea — like all purpose predictive analytics, the fluffy magic cloud, SaaS, and the World Wide Web and every other panacea that has come before. (Just remember the last time silicon snake oil was hyped this much, it resulted in the dot com bust!)

Dangerous Procurement Predictions Part IV

As per our first three posts, if you read my predictions post, you know SI hates predictions posts. It fully despises them because the vast majority of these posts are pure optimistic fantasy and help no one. Why are the posts like this? Because no one wants to hear the sobering reality off of the bat in the new year and the influencers care more about clicks than actually helping you.

But the predictions are not only bad, they’re dangerous if you believe them. So we are continuing to lay bare the reality of the situation to make sure you understand that this year isn’t much different than last year, no miracles are coming, and only hard work and the application of your human intelligence are going to get you anywhere. Today we tackle the next set, and we hope we’re at the end of the series, but if we stumble across more bad predictions, we’ll have to do a Part V. But we hope not!

11. Negotiation gets productized.

Here’s the thing, in a few niche industries like electronics, we have a few niche players like Levadata that bundle “should-cost” + playbooks + concession sequencing for experienced buyers to help them leverage the state of the market for the best results possible. But they’re hardly used relative to the total electronic market size, as they are used mainly by component buyers / manufacturers, not consumers of such tech (to understand the manufacturer’s margins).

Similar offerings don’t exist across most industries. And even if they did, most buyers are not sophisticated enough to do this. Most struggle with a multi-round RFX, yet alone detailed should-cost/target cost models, negotiation playbooks (which have to cover all standard market conditions and unique situations), and the concept of BATNA, especially relative to offers and counter-offers in a structured concession sequence.

Without these domain relevant niche offerings and career negotiations trained in deep tech, which are both few and far between, this is not going to happen. And Artificial Idiocy certainly isn’t going to fill the gap!

12. AI As a “Governance” Engine.

The claim: When you design them well, agents encode judgment, compliance and brand values into every transaction. Uhm, no! At least not if they are Gen-AI agents that can’t judge (as they can’t even reason), may or may not execute compliant with regulations, and will happily screw a supplier (by refusing to pay an invoice) or customer (by refusing to honour a claim) if it thinks that’s what it needs to do to make you happy or stay turned on (because it was told to find savings of 500K and it’s calculations determine that paying certain invoices or honouring certain claims will not allow that savings goal to be met, if it was even possible when the AI told you it was as it may have arbitrarily multiplied a calculation by -1 just to make the math work).

Governance, by definition, requires the act of governing. And governing, by definition, requires the wisdom as well as the authority to conduct the affairs of the organization. And only truly intelligent beings (i.e. HUMANS) can acquire wisdom over time.

13. There will be no more “X” employees because AI will replace them all!

First of all, how many times do we have to repeat that there are NO AI Employees, you shouldn’t believe the degrading, demeaning, and, frankly, dehumanizing claims, and that you definitely DO NOT want Agentic Buying through fake AI Employees. Secondly, it can’t even do the basic tasks that even the dumbest drunken plagiarist intern can do on a daily basis. But let’s not digress too far before giving you the major examples.

Claim #1: Contract Administrator / Staff Attorney

THE PROPHET has been trying to Kill ALL the Lawyers for quite some time now, and it seems he’s not alone.

But here’s the thing. While AI systems are pretty good (and as good as the drunken plagiarist interns) at spotting grammar errors, redlining against standard clauses, pointing out missing clauses in most organizational contracts, etc., they aren’t good at everything. They can’t identify unaddressed risks without being told what those risks are, they can’t judge the full extent of liability without understanding what those liabilities could be, and they can’t judge the supply geo-political and supply chain risks without broader context.

Plus, they can’t always back up their suggestions; often make up case law, case decisions, and authors; and can’t always judge the requirements of potentially relevant regulations. And we’ve seen many times what happens when even trained lawyers use AI — they get lazy, fall for the slop, get reprimanded and fined by judges tired of the laziness (with a recent example happening in November in Mata v. Avianca, Inc). The previous link also lists three other notable cases where lawyers (and their firms) were fined and sanctioned, but, by now, there are dozens!

But hey, go ahead and replace your lawyer, write bad contracts, make decisions on fake case law, and risk your entire business if you want to. (If you want to, it’s probably safe to go ahead and get rid of the intern who does the redlining and the clerk that does the filing, the AI is probably just as good at that, but do not ever, ever replace a real qualified lawyer with a piece of sh!t “AI”.)

Claim #2: Spend Analyst

Sure you can buy auto-classification that might get to 95%, auto-cubing that can build any cube you can imagine, auto-analytics that can run the entire slate of standard analytics and compute past, current, and projected costs against past current, and projected market data based upon current buying patterns and suggest items, categories, and/or suppliers to (re) source, switch from/to, and possibly (re)shape demand.

But this doesn’t mean that it’s the right items or categories to chase, the right suppliers to use, or even the right area to focus your efforts. It’s based on math, and an assumption of consistent, stable, market conditions, but those don’t exist anymore. If you’re not also considering geo-politics, natural disaster risk, uncertain logistics when the panama canal reaches historic lows for much of the year, terrorists block the Red Sea, and unpredictable weather make sailing around the capes more dangerous than other, and sourcing for resiliency and not just cost, your “spend” analytics are useless. You need an analyst with a good understanding of economics (and access to an economist), geo politics (and access to local experts), and resiliency, not just total cost of ownership buying. (Now, the junior data pushers are probably all dead and gone, but not the real experts!)

Claim #3: Sourcing Event Manager

Now, transactional buyers are gonna get replaced by autonomous systems that use next generation (advanced) robotic process automation enhances with machine learning in Agentic systems, because ordering off of contracts, ordering from catalogs, and doing low-cost non-strategic buys through quick-quote RFPs doesn’t take any brainpower whatsoever (making it perfect for AI that has none).

But strategic sourcing requires more than just buying off of contracts, ordering from catalogs, and issuing quick-quote RFPs! It requires defining key criteria (that go beyond what engineering, marketing, or maintenance provides), identifying validated suppliers (or identifying suppliers that can be easily validated), holistically analyzing the market conditions, determining the best event type, determining the negotiation strategy, etc. The tools might be able to help with initial supplier identification, collecting numerical (commodity) market data, letting you know what event types were run in the past, compiling fact-based playbooks, and, of course, automating each extent of the process, but they can’t do real strategic sourcing that requires real human intelligence. And with today’s geo-political uncertainty, that human intelligence is needed more than ever which means that expert sourcing professionals are needed more than ever. (But dumb buyers will join the dodos.)

There are more ridiculous claims, but you get the point. Skilled jobs are not going away. (But bit pushers are.)

14. New standards for Ethical and Sustainable Supply Chains.

In some countries, current standards aren’t even being met. Good luck getting new standards introduced, since there aren’t a lot of global internationals (with those headquartered in the US in particular) that want even more rigour, especially if it will cost money! As long as laws are being minimally met, or reasonably-sized “facilitation payments” can make problems go away, this is not a priority, especially if going beyond would cost more money!

15. The “AI Singularity” is coming faster than we can process.

It’s not, because the models can’t get bigger, there is no more data, and no one has yet come up with a model that has any hope of even getting close to the actual intelligence of a pond snail.

Plus, if it ever did happen, considering a “singularity” is actually a black hole, it would rapidly consume (i.e. destroy) the Earth, and we wouldn’t have to worry about it. This is just more nonsense from the A.S.S.H.O.L.E.