Output format problem about RoboBrian2.0-32B

Hi, first thanks for the open-source of such great work. 

I met some format problem when using your original format rule to extract the "think" and "answer" from the output of 32B version model when enable "thinking" mode. Here are the case:
 Case 1:
```bash
##### RAW_OUTPUT #####
based on the consistent visual evidence of the window and fridge being part of the scene from this angle, the answer is:
<answer>B. Window and fridge</answer></think><answer><think></think><answer>B. Window and fridge</answer></answer>
##### ANSWER #####
<think>
#################
```


Case 2：
```bash
##### RAW_OUTPUT #####
Considering the spatial layout and the task goal, the most logical next step is to initiate a left turn to reposition myself appropriately for exiting the bedroom and proceeding towards the hallway.</think><answer><think></think><answer>left 90</answer></answer>
##### ANSWER #####
<think>
#################
```


Case 3: 
```bash
##### RAW_OUTPUT #####
Yes</think>
##### ANSWER #####
(empty)
#################
```
As you can see the raw outputs of 3 cases seems not in desired format which lead to the wrong extracted answers(I leverage the official method from [inference.py](https://github.com/FlagOpen/RoboBrain2.0/blob/main/inference.py) to extract "answer" and "thinking").  And I didn't find such problem in the 7B version. 

I wanna ask if there are any solution for such problem or did I do some think wrong?



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Output format problem about RoboBrian2.0-32B #12

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Output format problem about RoboBrian2.0-32B #12

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions