One Different Image Layout Estimation and Drawing 3D Layout

Hi, I really appreciate the project and hope it can be developed more :)

Now, I'm trying to do only layout_estimation. My purpose is to give an image and take its layout 3D image.
Like this : 
![image](https://user-images.githubusercontent.com/53530231/124168296-9e99a980-daad-11eb-9009-fd058bb12837.png)
![image](https://user-images.githubusercontent.com/53530231/124168309-a35e5d80-daad-11eb-86be-ae193dc488fd.png)

First problem is that how can cam_K be estimated ? I have check out all code samples. I can could `layout` and `cam_R` estimation. In all your samples you use cam_K of data to draw 3D layout. How can I predict it or is there any way to draw 3D without cam_K.

Second problem is that I don't know I am doing correctly but when I tried to estimate layouts of demo datas, my results were really bad. I used demo.py steps to predict layout points.
For weight, I used your pretained_model firstly, then I trained 100 epochs and tried its weight. But the results was same.

I used here @chengzhag's `layout_estimation.yaml `
```
def estimate(img_path):
    cfg = CONFIG("configs/layout_estimation.yaml",)
    checkpoint = CheckpointIO(cfg)
    cfg = mount_external_config(cfg)
    device = load_device(cfg)
    cfg.config["mode"] = "demo"
    net = load_model(cfg, device=device)
    checkpoint.register_modules(net=net)

    cfg.config['demo_path'] = img_path
    data = load_demo_data(cfg.config['demo_path'], device)

    with torch.no_grad():
        est_data = net(data)
    
    

    lo_bdb3D_out = get_layout_bdb_sunrgbd(cfg.bins_tensor, est_data['lo_ori_reg_result'],
                                          torch.argmax(est_data['lo_ori_cls_result'], 1),
                                          est_data['lo_centroid_result'],
                                          est_data['lo_coeffs_result'])
    layout = lo_bdb3D_out[0,:,:].cpu().numpy()
    
    cam_R_out = get_rotation_matix_result(cfg.bins_tensor,
                                          torch.argmax(est_data['pitch_cls_result'], 1), est_data['pitch_reg_result'],
                                          torch.argmax(est_data['roll_cls_result'], 1), est_data['roll_reg_result'])
    pre_cam_R = cam_R_out[0, :, :].cpu().numpy()

    pre_layout = format_layout(layout)
    
    return pre_layout, pre_cam_R
```
To draw 3D layout : 
(I'm getting cam_K of the sample. Not shown here)

```
img_path = "./demo/inputs/1"
sequence_id = img_path[-1]   
    
rgb_image = np.asarray(Image.open(img_path+"/img.jpg").convert('RGB'))
pre_layout , pre_cam_R = estimate(img_path)
scene_box = Box(rgb_image, None, cam_K, None, pre_cam_R, None,
                pre_layout, None, None, 'prediction', None)

scene_box.draw3D(if_save=True, save_path = './demo/sunrgbd/%s_recon.png' % (sequence_id))
```
I got results like this:
![image](https://user-images.githubusercontent.com/53530231/124171080-c2aaba00-dab0-11eb-82f0-06cedca18e90.png)

It should seem like this : 
![image](https://user-images.githubusercontent.com/53530231/124171381-29c86e80-dab1-11eb-822f-f55b4af6af31.png)

I hope that I could express myself clearly. 
Thank very much^^



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

One Different Image Layout Estimation and Drawing 3D Layout #36

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

One Different Image Layout Estimation and Drawing 3D Layout #36

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions