Homechevron_rightNewschevron_rightNews Releasechevron_rightAmiVoice® Communication Suite, an AI speech recognition solution for contact centers, now features a new function, "AI Multi-Stage Inference," that utilizes multiple generative AIs to enhance processing.

AI voice recognition solution for contact centersAmiVoice® Communication Suite" has implemented a new function, "AI Multi-Stage Inference," which utilizes multiple generative AIs to enhance processing.

Combining the optimal AI for each purpose to improve accuracy, efficiency, and cost optimization

 Advanced Media Inc. (Headquarters: Toshima-ku, Tokyo; Chairman and CEO: Suzuki Kiyoyuki; hereinafter referred to as "Advanced Media") has implemented an "AI multi-stage inference" function in its "AmiVoice Communication Suite," a contact center solution equipped with the AI ​​voice recognition AmiVoice. This function improves accuracy, efficiency, and cost optimization by using multiple generation AIs depending on the call content, purpose, and use, and by dividing up processing in stages.

Examples of using "AI multi-stage inference"

 "AmiVoice Communication Suite" has the No. 1 domestic market share*It is an AI speech recognition solution for contact centers. In August 2024, we began offering "AOI LLM for AmiVoice Communication Suite (AOI LLM)," an optional generation AI that can be used securely in a local environment, and in February 2025, we released an external generation AI integration function.

 This function was developed in response to the needs of companies using "AmiVoice Communication Suite" and "AOI LLM" who wanted to "introduce a system that automatically switches to the most appropriate prompt depending on the content of the call." In calls at contact centers, the necessary summarization and information extraction points vary depending on the inquiry content and purpose, such as "I would like to change the delivery date" or "The staff member's response was poor." Therefore, it was difficult to respond adequately with processing using a single conventional generation AI, and there were issues with accuracy and operational efficiency.

 In this context, AmiVoice Communication Suite has implemented a new function called "AI Multi-Stage Inference," which enables more accurate and flexible processing by combining and gradually utilizing multiple generation AIs according to the content, purpose, and use of the call.
 The type of processing to be performed at each stage of "AI multi-stage inference" can be flexibly configured for each customer. By selecting and switching between the optimal generation AI and prompt depending on the purpose and application, the output quality of the generation AI can be improved, enabling more advanced and flexible processing. This function makes it possible to organize complex topics and process multifaceted information, which were difficult to handle with conventional processing using a single generation AI. In addition, by switching between low-cost and high-performance generation AI depending on the difficulty of the task, it also contributes to cost optimization.
This function also allows you to set the AI ​​and prompts to be used according to keywords included in the call. Furthermore, if you are using CTI integration, you can automatically change the AI ​​and prompts to be used according to the skills defined in the system.

■Examples of using "AI multi-stage inference"
① Classify the inquiry content and summarize it according to the call reason
The first-stage generation AI analyzes the content of the call and categorizes it into categories such as "address change," "lost card," and "confirmation of usage details." Based on the classification results, the second-stage generation AI performs summarization using prompts optimized for each category. By automating summarization according to the call reason, it significantly reduces the workload of operators and contributes to shortening post-processing time.

② Utilizing AI generation based on call attribute information through CTI integration
For inbound calls, the generation AI summarizes and analyzes the content of the call, and for outbound calls, it proposes scripts, making it possible to utilize AI according to the attributes of the call through CTI integration.In addition, by setting the processing content of the generation AI for each skill defined in the system, such as "updating contract information" and "handling cancellations," more flexible and accurate support can be provided, achieving high-quality responses that are not dependent on operator skill.

3) Combining local and cloud-based generation AI to achieve secure operation
In the first stage, AOI LLM, a secure generation AI that can be used in a local environment, is used to mask personal information such as names, addresses, and phone numbers. Next, the masked text data is passed to a cloud-based generation AI (such as ChatGPT), which performs advanced processing such as summarization, classification, and analysis. By passing only data that does not contain personal information to the cloud-based generation AI, it can be used safely even in industries with strict security policies.

④ Utilizing multiple generation AI to suppress hallucination
Combining different generation AIs can also be expected to suppress hallucinations that are dependent on specific models. For example, by re-presenting the summary results output by the first-stage generation AI along with the call data to the second-stage generation AI, the consistency and validity of the content can be verified and supplemented. By utilizing multiple generation AIs, it is possible to prevent the generation of incorrect information and obtain more reliable output.

"AIFeatures of "Multi-stage Inference"

1. Combining multiple generative AIs for different purposes to improve accuracy and efficiency
By combining multiple generation AIs according to the content of the call, purpose, and application, more accurate and flexible processing can be achieved, helping to improve the accuracy of output results from generation AI and significantly improve work efficiency. The generation AI, prompts, and processing content to be used can be configured for each customer.

2. Optimizing costs by selecting generation AI according to task difficulty
By using either low-cost or high-performance generation AI depending on the difficulty of the processing to be performed by the generation AI, operational costs can be optimized, achieving both operational efficiency and cost performance.

3. Versatility that allows flexible use in a variety of tasks
It can be used for a variety of purposes in contact centers, including classifying and analyzing call content, summarizing, extracting information, organizing text, masking personal information, and hallucination checks.It can be used in a wide range of industries, from manufacturer customer service desks to insurance and financial contact centers that handle complex procedures.

 Advanced Media will continue to expand the possibilities of business support using AI speech recognition AmiVoice and generative AI, aiming to further improve the efficiency and quality of contact center operations.

"AmiVoice Communication Suite"about

"AmiVoice Communication Suite" is equipped with AI voice recognition AmiVoice and has the No. 1 domestic market share*This is a solution for contact centers. In addition to converting all call content into text, it has many functions such as emotion analysis, topic extraction, simultaneous monitoring of multiple calls by administrators, and operator support, helping to visualize call content and improve response quality.
Cloud version/on-premises version, real-time recognition processing/batch recognition processing are available, allowing flexible operation regardless of the number of seats or scale of the contact center.
https://www.advanced-media.co.jp/lp/communication-suite/

"AOI LLM for AmiVoice Communication Suite"about

"AOI LLM for AmiVoice Communication Suite" is a generative AI service that can generate and summarize call content, extract Q&A, and extract VoC (voice of the customer) in the customer's local environment without releasing any call data, including confidential information, to the outside world. It is optimized for contact center operations through customization and system integration using knowledge and specialized technology based on a track record of implementation in contact centers in a wide range of industries. It can be used without restrictions such as the number of times it can be used, as with cloud-based generative AI services. "AOI LLM for AmiVoice Communication Suite" is provided as an option for "AmiVoice Communication Suite," a voice recognition solution for contact centers.
https://www.advanced-media.co.jp/products/service/aoi-llm

* Source: ITR "ITR Market View: Image and Voice Recognition Market 2024" Voice recognition market - Sales share by vendor for contact center operations (FY2024 forecast)


Above

Inquiries regarding this matter

Management Promotion Headquarters Public Relations Group
Inquiries regarding the contents of this report

CTI Division
Inquiries about this product

Japan's No.1 in Market ShareJapan's No.1 in Market ShareAmiVoiceⓇAmiVoiceⓇ

*Source: ecarlate LLC "Voice Recognition Market Trends 2025"
Speech recognition software/cloud service market

Write with your voice, move with your voice.
AI voice recognition AmiVoice
In various business situations,
This is a technology that enables natural communication between people and machines.