Document Object Model

12h

Model helps predict hip fractures among women with osteoporosis by analyzing only 7% of the joint

Scientists at Pompeu Fabra University (UPF) have made a great leap forward in predicting the risk of hip fracture among women ...

Mistral launches OCR 3 to digitize enterprise documents, touts 74% win rate and $2-per-1,000-page pricing

Mistral AI launches OCR 3 at $2 per 1,000 pages, arguing that document digitization — not chatbots — is the critical first ...

Microsoft

DocReward: A Document Reward Model for Structuring and Stylizing

Recent advances in agentic workflows have enabled the automation of tasks such as professional document generation. However, they primarily focus on textual quality, neglecting visual structure and ...

GitHub

DifFUSER: Diffusion Model for Robust Multi-Sensor Fusion in 3D Object Detection and BEV Segmentation

[2024/12] Code release: Inferece, Diffusion sampling, Pretrained model. [2024/10] DifFUSER is presented at ECCV 2024. [2024/07] DifFUSER is accepted by ECCV 2024. This repository contains the official ...

GitHub

Moshi: a speech-text foundation model for real time dialogue

Finally, the code for the web UI client used in the Moshi demo is provided in the client/ directory. If you want to fine tune Moshi, head out to kyutai-labs/moshi ...

IEEE

Fast and Accurate 6-D Object Pose Refinement via Implicit Surface Optimization

Abstract: Aligning a point cloud to a fixed 3-D model is a crucial task in many applications, such as 6-D pose estimation for robotic grasping. Typically, an initial pose is estimated by analyzing ...

IEEE

TrackingMamba: Visual State Space Model for Object Tracking

Abstract: In recent years, UAV object tracking has provided technical support across various fields. Most existing work relies on convolutional neural networks (CNNs) or visual transformers. However, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results