|
1 | 1 | # 每日从arXiv中获取最新YOLO相关论文
|
2 | 2 |
|
3 | 3 |
|
| 4 | +## BFA\-YOLO: Balanced multiscale object detection network for multi\-view building facade attachments detection |
| 5 | + |
| 6 | +**发布日期**:2024-09-06 |
| 7 | + |
| 8 | +**作者**:Yangguang Chen |
| 9 | + |
| 10 | +**摘要**:Detection of building facade attachments such as doors, windows, balconies, |
| 11 | +air conditioner units, billboards, and glass curtain walls plays a pivotal role |
| 12 | +in numerous applications. Building facade attachments detection aids in |
| 13 | +vbuilding information modeling \(BIM\) construction and meeting Level of Detail 3 |
| 14 | +\(LOD3\) standards. Yet, it faces challenges like uneven object distribution, |
| 15 | +small object detection difficulty, and background interference. To counter |
| 16 | +these, we propose BFA\-YOLO, a model for detecting facade attachments in |
| 17 | +multi\-view images. BFA\-YOLO incorporates three novel innovations: the Feature |
| 18 | +Balanced Spindle Module \(FBSM\) for addressing uneven distribution, the Target |
| 19 | +Dynamic Alignment Task Detection Head \(TDATH\) aimed at improving small object |
| 20 | +detection, and the Position Memory Enhanced Self\-Attention Mechanism \(PMESA\) to |
| 21 | +combat background interference, with each component specifically designed to |
| 22 | +solve its corresponding challenge. Detection efficacy of deep network models |
| 23 | +deeply depends on the dataset's characteristics. Existing open source datasets |
| 24 | +related to building facades are limited by their single perspective, small |
| 25 | +image pool, and incomplete category coverage. We propose a novel method for |
| 26 | +building facade attachments detection dataset construction and construct the |
| 27 | +BFA\-3D dataset for facade attachments detection. The BFA\-3D dataset features |
| 28 | +multi\-view, accurate labels, diverse categories, and detailed classification. |
| 29 | +BFA \-YOLO surpasses YOLOv8 by 1.8% and 2.9% in [email protected] on the multi \-view BFA \-3D |
| 30 | +and street\-view Facade\-WHU datasets, respectively. These results underscore |
| 31 | +BFA\-YOLO's superior performance in detecting facade attachments. |
| 32 | + |
| 33 | + |
| 34 | +**代码链接**:摘要中未找到代码链接。 |
| 35 | + |
| 36 | +**论文链接**:[阅读更多](http://arxiv.org/abs/2409.04025v1) |
| 37 | + |
| 38 | +--- |
| 39 | + |
| 40 | + |
4 | 41 | ## YOLO\-CL cluster detection in the Rubin/LSST DC2 simulation
|
5 | 42 |
|
6 | 43 | **发布日期**:2024-09-05
|
@@ -293,38 +330,3 @@ tumor resection and improving surgical outcomes.
|
293 | 330 |
|
294 | 331 | ---
|
295 | 332 |
|
296 |
| - |
297 |
| -## Multi\-faceted Sensory Substitution for Curb Alerting: A Pilot Investigation in Persons with Blindness and Low Vision |
298 |
| - |
299 |
| -**发布日期**:2024-08-26 |
300 |
| - |
301 |
| -**作者**:Ligao Ruan |
302 |
| - |
303 |
| -**摘要**:Curbs \-\- the edge of a raised sidewalk at the point where it meets a street |
304 |
| -\-\- crucial in urban environments where they help delineate safe pedestrian |
305 |
| -zones, from dangerous vehicular lanes. However, curbs themselves are |
306 |
| -significant navigation hazards, particularly for people who are blind or have |
307 |
| -low vision \(pBLV\). The challenges faced by pBLV in detecting and properly |
308 |
| -orientating themselves for these abrupt elevation changes can lead to falls and |
309 |
| -serious injuries. Despite recent advancements in assistive technologies, the |
310 |
| -detection and early warning of curbs remains a largely unsolved challenge. This |
311 |
| -paper aims to tackle this gap by introducing a novel, multi\-faceted sensory |
312 |
| -substitution approach hosted on a smart wearable; the platform leverages an RGB |
313 |
| -camera and an embedded system to capture and segment curbs in real time and |
314 |
| -provide early warning and orientation information. The system utilizes YOLO |
315 |
| -\(You Only Look Once\) v8 segmentation model, trained on our custom curb dataset |
316 |
| -for the camera input. The output of the system consists of adaptive auditory |
317 |
| -beeps, abstract sonification, and speech, conveying information about the |
318 |
| -relative distance and orientation of curbs. Through human\-subjects |
319 |
| -experimentation, we demonstrate the effectiveness of the system as compared to |
320 |
| -the white cane. Results show that our system can provide advanced warning |
321 |
| -through a larger safety window than the cane, while offering nearly identical |
322 |
| -curb orientation information. |
323 |
| - |
324 |
| - |
325 |
| -**代码链接**:摘要中未找到代码链接。 |
326 |
| - |
327 |
| -**论文链接**:[阅读更多](http://arxiv.org/abs/2408.14578v2) |
328 |
| - |
329 |
| ---- |
330 |
| - |
|
0 commit comments