Notice
Recent Posts
Recent Comments
Link
일 | 월 | 화 | 수 | 목 | 금 | 토 |
---|---|---|---|---|---|---|
1 | 2 | 3 | 4 | 5 | 6 | |
7 | 8 | 9 | 10 | 11 | 12 | 13 |
14 | 15 | 16 | 17 | 18 | 19 | 20 |
21 | 22 | 23 | 24 | 25 | 26 | 27 |
28 | 29 | 30 |
Tags
- 백준
- hm3dsem
- YoLO
- eecs 498
- machine learning
- LSTM
- real-time object detection
- AlexNet
- DP
- hm3d
- CNN
- Python
- C++
- two-stage detector
- 머신러닝
- dynamic programming
- dfs
- deep learning
- 그래프 이론
- ubuntu
- MySQL
- image processing
- 딥러닝
- r-cnn
- Mask Processing
- Reinforcement Learning
- 강화학습
- BFS
- NLP
- opencv
Archives
- Today
- Total
목록3dmv-vqa (1)
JINWOOJUNG

Paperhttps://arxiv.org/abs/2303.11327 3D Concept Learning and Reasoning from Multi-View ImagesHumans are able to accurately reason in 3D by gathering multi-view observations of the surrounding world. Inspired by this insight, we introduce a new large-scale benchmark for 3D multi-view visual question answering (3DMV-VQA). This dataset is collected barxiv.org IntroductionVisual Reasoning은 시각적 장면에 ..
NLP, LLM, Multi-modal
2025. 6. 26. 00:59