Document Object Model in XML

DocReward: A Document Reward Model for Structuring and Stylizing

Recent advances in agentic workflows have enabled the automation of tasks such as professional document generation. However, they primarily focus on textual quality, neglecting visual structure and ...

GitHub

facebookresearch/sam-3d-objects

SAM 3D Objects is a foundation model that reconstructs full 3D shape geometry, texture, and layout from a single image, excelling in real-world scenarios with occlusion and clutter by using ...

GitHub

DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation

[2024/12] Code release: Inferece, Diffusion sampling, Pretrained model. [2024/10] DifFUSER is presented at ECCV 2024. [2024/07] DifFUSER is accepted by ECCV 2024. This repository contains the official ...

IEEE

TrackingMamba: Visual State Space Model for Object Tracking

Abstract: In recent years, UAV object tracking has provided technical support across various fields. Most existing work relies on convolutional neural networks (CNNs) or visual transformers. However, ...

IEEE

Exploring Vision-Language Foundation Model for Novel Object Captioning

Abstract: It is always well believed that pre-trained vision-language foundation models (e.g., CLIP) would substantially facilitate vision-language tasks. Nevertheless, there has been less evidence in ...

Federal Register

Notice of Determinations; Culturally Significant Objects Being Imported for Exhibition-Determinations: “New Rome: The Art of Byzantium” Exhibition

This site displays a prototype of a “Web 2.0” version of the daily Federal Register. It is not an official legal edition of the Federal Register, and does not replace the official print version or the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results