Who is Daniel Aharonoff?

Daniel Aharonoff is a technology investor and entrepreneur with over 25 years of experience in the digital media sector. He is currently focused on exploring the potential of blockchain and artificial intelligence to create innovative solutions to real-world problems. Through his company, BroadScaler Consulting, Daniel has shaped the digital strategies of leading entertainment, SaaS, and consumer marketing brands. He's also the co-founder of VideoDome Networks and his latest venture, ATM.TV, is a joint venture with 7-Eleven.

What is VideoDome Networks?

VideoDome Networks is a middleware online video platform provider, co-founded by Daniel Aharonoff, which gained attention from movie and television studios, leading to partnerships with industry icons such as NBC, Fox Kids, and Haim Saban.

ATM.TV is a joint venture with 7-Eleven that monetizes their nearly 9,000 nationwide in-store screens, reaching more than 5 billion verified impressions per year. ATM.TV provides the opportunity to deliver targeted, high-definition digital images to consumers at the point of their final buying decisions. It has attracted several hundred notable brand advertisers and is thriving.

Q*: Boosting LLM Reasoning with Planning

Produced by Daniel Aharonoff & Mogul Media AI - June 24, 2024

You give Large Language Models (LLMs) an inch, and they take a mile. Well, maybe not that far, but they do tend to wander off the beaten path. Especially with multi-step reasoning. These AI marvels, while great at mimicking human language, sometimes struggle to stay on track when a task requires multiple steps. That’s because their autoregressive nature—predicting the next word based on what came before—can lead them down rabbit holes of errors, hallucinations (like those AI-generated images of cats with seven legs), and contradictions.

The Q* Solution

The challenge we face is how to keep these LLMs on the straight and narrow. A team of researchers may have a solution with Q. Imagine a digital shepherd gently guiding a flock of sheep (those wandering LLMs) through a field of words. That’s essentially what Q does. It acts as a heuristic function, a fancy term for a “rule of thumb,” that evaluates which path is most likely to lead the LLM to the correct answer.

How Q* Works

Deliberative Planning: Instead of letting the LLM simply blurt out the first thing that comes to mind, Q* encourages it to plan. Like a chess player contemplating their next move.
Q-Value Model: This is the heart of Q*. It acts as a scorecard, rating each potential “next step” the LLM could take. The higher the score, the more promising the path.
Plug-and-Play: The beauty of Q* is that it’s designed to be versatile. You can apply it to different LLMs and various reasoning tasks without retraining the entire model. Think of it as a universal remote for LLMs.

Promising Results and a Look Ahead

Early tests of Q* on benchmark datasets like GSM8K (for grade-school math problems) and MATH have shown promising improvements in accuracy. It’s like giving those LLMs a much-needed dose of focus and discipline.

Of course, this is just the beginning. The field of AI is constantly evolving, and there’s still much to explore. But with innovative approaches like Q*, we’re getting closer to harnessing the full potential of LLMs for complex reasoning tasks.

Fun Fact: The name “Q*” is a nod to Q-learning, a reinforcement learning technique where an agent learns to make decisions by maximizing rewards. In this case, the “reward” is guiding the LLM towards more accurate and coherent reasoning.

This new approach, while still in its early stages, has the potential to make LLMs even more powerful and reliable tools for tackling intricate problems. It’s an exciting time to be watching this field unfold!