Evaluating the Role of Large Language Models (LLM) in API Orchestration and Decision-Making

Is Using an LLM as the Reasoning Engine a Good Idea? What is a multi-agent concept? How does Waylay utilize subflows to manage what an LLM can and cannot do from a control perspective?

In this video, we explore these questions in depth.

‍

Navigating the World of Multi-Agent Systems with LLM

Until now, we have tackled digitization challenges by converting everything into APIs, enabling every capability to be utilized by RPA tools to replicate human tasks, integrate information, and determine necessary actions. An important question arises: will large language models be sophisticated enough to handle API orchestration independently, serving as the reasoning engine? This is a significant consideration - to say the least. For this to be feasible, we must not only transform every API into an agent (providing LLM with capabilities) but also evaluate whether LLMs alone can effectively manage actions and decisions. This is a crucial question.

‍

In order to illustrate the challenge and how we in Waylay address shortcomings of LLM as a reasoning engine, we prepared a simple video, which illustrates each of these items separately.

‍

Turning API's to LLM agents

In our framework, every API can be transformed into an agent in a matter of minutes. For instance, consider APIs for rolling dice or creating alarms. By defining these APIs as agents, we enable the LLM to manage and interact with them seamlessly. The process involves wrapping the API functionalities into agents, providing the LLM with descriptions of their functions and required inputs.

‍

We kick off with a simple yet illustrative example: a multi-agent game involving dice rolls. The goal is straightforward—roll the dice a few times and, if a guess is correct, trigger an alarm. This example serves as a foundation for understanding how LLMs can be employed to interact with various agents and APIs, effectively turning them into functional components of a broader system.

‍

Testing the Setup

To demonstrate this, we ran a practical test where the LLM was tasked with rolling the dice three times and creating an alarm based on the outcome. Do we trust LLM that will do the right thing? Probably yes, that doesn't sound as too hard a problem to crack for LLM, but we still need to watch out for LLM assumptions in relation to providing inputs to different agents, which if not clearly specified can lead to LLM filling the gaps with defaults that might or might not be desired.

‍

Managing Complex Scenarios

We then tackled a more complex use case: finding a car rental service when a vehicle breaks down. This scenario involved coordinating multiple agents—contract agent, rental agent, and location agent. The LLM’s role was to mediate between these agents to provide a solution. Since this scenario requires not only a simple API search but also contract verification and day rate allowance checks to find the appropriate car, can we fully trust the LLM, knowing it sometimes struggles with basic math problems? Probably not. However, can we mitigate these issues and still leverage the LLM for its strengths? Specifically, using it to call the right agents at the right time and, alongside its reasoning engine, provide a simple interface (bot/text or voice) to guide a user to the correct solution?

‍

Implementing Subflows for Better Control

This example highlighted the necessity of having complex parts of the problem encapsulated as sub-flows, where we can with absolute certainty "control the reasoning process", in order to avoid potential issues such as incorrect calculations or “hallucinations” by the LLM.

‍

For instance, a location service was used to find the nearest rental service based on a user’s phone number and the exact car location, while a sub-flow efficiently managed complex data interactions. Once the location and the name of the person in need were known, the remaining decisions were handled by a sub-flow presented to the LLM as another agent. Unlike the initial location agent, this sub-flow acted as a "super agent," a composable API flow with logic that the LLM recognized as an additional capability.

‍

Transform Your Decision-Making with Advanced AI: Integrate LLMs for Enhanced Automation Today!

Are you ready to revolutionize your decision-making process with cutting-edge AI?

Let us help you streamline operations, enhance accuracy, and unlock new capabilities with advanced AI solutions. Reach out now to explore how we can transform your business!

‍

See the video below to see the combination of Waylay and FLS VISITOUR in action:

‍

What’s next?

‍

Autonomous service operations is getting supercharged by the advent of smart synthetic software agents, powered by Large Language Models (LLMs). These synthetic agents will assist human service agents to increase capacity and reduce tedious manual work, like root cause analysis of asset performance issues, updating work plans to deal with impending asset shut downs, etc.

‍

LLM technologies have matured enough to couple automated asset health monitoring with autonomous field job scheduling to improve asset uptime and Service Level Agreement adherence. Waylay’s analytics and orchestration platform can serve various agentic LLM applications for autonomous service operations that leverages the FLS VISITOUR scheduling engine to optimize the field force load and reduce wasted travel hours. The result is faster preventive asset maintenance activities, less human error during scheduling and an overall better end customer experience.

‍

Want to know more? Please get in touch with us here

Evaluating the Role of Large Language Models (LLM) in API Orchestration and Decision-Making