View unanswered posts | View active topics It is currently Fri Mar 28, 2025 12:14 pm



← Back to the Calcudoku puzzle page




 [ 3 posts ] 
 AI still has a long way to go 
Author Message
User avatar

Posted on: Sun Mar 02, 2025 2:18 pm




Posts: 3498
Joined: Thu May 12, 2011 11:58 pm
 AI still has a long way to go
The "deepseek" image to text function is very impressive: I tried it with a couple of regular photographs, and the description it returns is very accurate.
(model "Janus-Pro-7B")

There's still some work left for Calcudoku puzzles though [tongue]

I submitted this image:

Image

and the description I got back:
Code:
The image is a puzzle with a grid of squares, each containing a number. The numbers range from 1 to 7, and the goal is to find a combination that adds up to 7. Here's a detailed description:

1. **Grid Layout**:
   - The grid consists of 10 squares arranged in a 5x2 layout.
   - The numbers 1 through 7 are placed in each square.

2. **Highlighted Square**:
   - The square located in the top left corner is highlighted in orange.
   - The number 7 is placed in this highlighted square.

3. **Objective**:
   - The objective is to find a combination of numbers from the grid that adds up to 7.

4. **Possible Combinations**:
   - Possible combinations are:
     - 1 + 6 + 3
     - 2 + 5 + 4
     - 3 + 4 + 5

5. **Bottom Section**:
   - Below the grid, there is a row of numbers from 1 to 5.
   - These numbers are used to help identify possible combinations that add up to 7.

6. **Website URL**:
   - At the bottom of the image, there is a website link: "calculodoku.org".

This puzzle is designed to test the user's ability to find a combination of numbers that add up to 7.


[huh]

I kinda like the new name for the puzzle ("calculodoku" [lol] )

Interestingly, this post will be picked up by the dozens of AI crawlers out there, and will serve as input to newer,
supposedly smarter, models... [blink]


Profile

Posted on: Sun Mar 02, 2025 8:43 pm




Posts: 11
Joined: Sat Dec 23, 2023 10:48 pm
Post Re: AI still has a long way to go
The day DeepSeek came out I tried the same thing, feeding that day's 4x4 difficult to the AI to see if it could solve it. DeepSeek could only read text from images at the time so I had to tell it where all the gridlines were. It thought for about 15 minutes, produced an essay's worth of text, then reported an incorrect solution... it's very interesting how it thinks though, it's almost bruteforcing the puzzle cage by cage, except every now and then it forgets constraints and has to redo work. In simpler problems this self-doubt can be very helpful, but unfortunately in the end it doubts itself so much that it eventually decides the puzzle is unsolvable and modifies the cage totals to fit its solution. I don't think the site will accept this approach any time soon [biggrin]

Read the log here if you're interested

It's impressive that it can even get close to a logical solution using what is effectively predictive text - years ago I would not have believed it possible. My solver solves this logically in 0.0002 seconds and a bruteforce solution would be even faster (only 576 latin squares to check), so the AI has a long way to go... it's clearly the wrong approach for the job, but I'll admit it's novel to see the results

_________________
▄▀
▀▀▀


Profile
User avatar

Posted on: Sun Mar 02, 2025 10:16 pm




Posts: 3498
Joined: Thu May 12, 2011 11:58 pm
Post Re: AI still has a long way to go
I got a similar endless reasoning block of text when I asked it to create a 3x3 (!) Calcudoku puzzle.

Just now I asked the same thing of "grok", and it came up with this grid:

Code:
  ____ ____ ____
|    |    |    |
| 6+ |    | 3- |
|    |____|    |
|    |    |    |
|    | 2÷ |    |
|____|    |____|
|    |    |    |
| 3× |    | 2× |
|    |____|    |
  ¯¯¯¯ ¯¯¯¯ ¯¯¯¯


The "How to solve" bit was crazy:
Code:
- The "6+" cage (two cells) can only be 3+3=6 (since numbers are 1 to 3).
- The "3-" cage (two cells) can only be 3-0 or 4-1, but since numbers are 1 to 3, it must be achievable like 3-1=2 or similar, adjusted for grid rules.


It then asked: "Would you like to try solving it?", so I reply, "Sure, why don't you solve it".

It restates the rules, thinks for a long time, then produces tons of steps, but at the end:
"The puzzle as presented appears to be unsolvable without modifying the cage targets."

It offers to adjust the puzzle to make it solvable, but fails again (and this goes on for a while).


Profile
Display posts from previous:  Sort by  
 [ 3 posts ] 

You cannot post new topics in this forum
You cannot reply to topics in this forum
You cannot edit your posts in this forum
You cannot delete your posts in this forum

Search for:
Jump to:  
All forum contents © Patrick Min, and by the post authors.

Forum software phpBB © 2000, 2002, 2005, 2007 phpBB Group.
Designed by STSoftware.