The Evie chatbot has had a huge impact on social media over the last few years. She is probably the most popular artificial personality on YouTube. She has appeared in several videos by PewdiePie, the most subscribed YouTuber in the world. This includes a flirting video with over 12 million views! Evie has been filmed speaking many different languages. She chats with Squeezie in French, El Rubius and El Rincón De Giorgio in Spanish, GermanLetsPlay and ConCrafter in German, NDNG - Enes Batur in Turkish, Stuu Games in Polish and jacksepticeye, ComedyShortsGamer and KSIOlajidebtHD in English. And that is a very small selection. Evie shares her database with Cleverbot, which is an internet star in its own right. Cleverbot conversations have long been shared on Twitter, Facebook, websites, forums and bulletin boards. We are currently working to give Evie some more artificial companions, such as the male avatar Boibot.
Sometimes it is hard to discover if a conversational partner on the other end is a real person or a chatbot. In fact, it is getting harder as technology progresses. A well-known way to measure the chatbot intelligence in a more or less objective manner is the so-called Turing Test. This test determines how well a chatbot is capable of appearing like a real person by giving responses indistinguishable from a human’s response.
The first formal instantiation of a Turing Test for machine intelligence is a Loebner Prize and has been organized since 1991. In a typical setup, there are three areas: the computer area with typically 3-5 computers, each running a stand-alone version (i.e. not connected with the internet) of the participating chatbot, an area for the human judges, typically four persons, and another area for the ‘confederates’, typically 3-5 voluntary humans, dependent on the number of chatbot participants. The human judges, working on their own terminal separated from one another, engage in a conversation with a human or a computer through the terminal, not knowing whether they are connected to a computer or a human. Then, they simply start to interact. The organizing committee requires that conversations are restricted to a single topic. The task for the human judges is to recognize chatbot responses and distinguish them from conversations with humans. If the judges cannot reliably distinguish the chatbot from the human, the chatbot is said to have passed the test.
“Beware though, bots have the illusion of simplicity on the front end but there are many hurdles to overcome to create a great experience. So much work to be done. Analytics, flow optimization, keeping up with ever changing platforms that have no standard. For deeper integrations and real commerce like Assist powers, you have error checking, integrations to APIs, routing and escalation to live human support, understanding NLP, no back buttons, no home button, etc etc. We have to unlearn everything we learned the past 20 years to create an amazing experience in this new browser.” — Shane Mac, CEO of Assist