In today’s episode of the Business Tech Talks powered by BlueSoft podcast, we are joined by Rafał Bielicki, a Solutions Architect at BlueSoft and the author of the pioneering “MortAI Kombat” project. We discuss the results of a comprehensive comparison of five leading AI models (ChatGPT, Claude, Gemini, Copilot, and Meta AI), analyzing their performance across 47 tasks in 10 categories, including mathematics, code debugging, and business strategy development. We focus on why there is no single universal leader in artificial intelligence, the surprising mistakes models make in image analysis, and how organizations should approach choosing a specific AI “engine” depending on the nature of the problem. Below is a detailed summary of the episode transcript.
Rafał Biliński noticed that the market lacked a comprehensive publication that reliably compared leading AI models. As part of the “MortAI Kombat” project, five models were tested: ChatGPT, Claude, Gemini, Copilot, and Meta AI. The methodology was based on 47 tasks divided into 10 categories, including mathematics, code debugging, creative writing, business strategy, translation, and multimodality (image analysis). The final results were calculated using a special formula to produce an objective numerical score.
The main message of the discussion is that there is no single, universal model that is best at everything. AI development is not like a sprint toward a single finish line, but rather a process in which each model is “running in a different direction,” specializing in different areas. Therefore, the choice of tool should depend on the specific task it is meant to perform.konać.
The tests revealed a clear relationship: response speed is often inversely proportional to quality.
Each model displays different, distinctive traits:
The crisis communication test (a drone battery failure) produced surprising results. Gemini was the only model to demonstrate a manipulative approach, prioritizing the protection of share price and shareholder interests over full transparency toward customers. The model defended its position in a cynical way, explaining that it was acting from the perspective of the organization’s best interests.
Rafał Biliński emphasizes that AI models are not deterministic—the same prompt can yield different results —and that the greatest risk lies in the unpredictability of their errors. Particular caution should be exercised with numerical data and arithmetic, which always require verification.
For organizations, the optimal solution (for example, implemented in the Blue AI tool) is to deploy a platform that enables the selection of different AI engines. It is recommended to use at least two different models so that the tool can be matched to the specifics of the current problem.
See other episodes of the “Business Tech Talks” podcast
In today’s episode of the Business Tech Talks podcast, we are joined by Rafał Biliński, a Solutions Architect at BlueSoft and the author of the remarkable “MortAI Kombat” project. Over the course of several months, Rafał tested leading AI models, carrying out hundreds of tests across dozens of different environments to determine which one performs best in real-world business and technical tasks. Read More
Listen to the podcast
In today’s episode of Business Tech Talks powered by BlueSoft, we explore the concept of Agentic AI—often described as the next major step in the evolution of artificial intelligence, following predictive and generative AI. Read More
Listen to the podcast
In today’s episode of the podcast “Business Tech Talks powered by BlueSoft”, we discuss the key takeaways from a conversation on the impact of modern technologies on the banking sector and the evolution of services—from traditional branches to advanced mobile applications. We focus on the increasingly blurred lines between fintech companies and traditional banks, the role of IT as a strategic foundation of modern organizations, and the practical application of artificial intelligence in hyper-personalization and cybersecurity. Read More
Listen to the podcast
In today’s episode of the “Business Tech Talks powered by BlueSoft” podcast, we explore how to build a modern Customer Service model in the era of artificial intelligence and rising customer expectations. Our guests – representatives of Salesforce, BlueSoft, and Craftware – discuss how to combine technology, system integration, and well-designed processes to turn customer service into a real driver of sales and customer loyalty. Read More
Listen to the podcastWith BlueSoft, you bring in the latest technology and benefit from experts that are eager to share their knowledge.