普通视图

Received before yesterday7 - PubMed

7 - PubMed
Cor<em>Deep</em> and the Sacrobosco Dataset: Detection of Visual Elements in Historical Documents 2022年10月26日 18:00

7 - PubMed

Cor<em>Deep</em> and the Sacrobosco Dataset: Detection of Visual Elements in Historical Documents

作者Jochen Büttner · Julius Martinetz · Hassan El-Hajj · Matteo Valleriani

2022年10月26日 18:00

J Imaging. 2022 Oct 15;8(10):285. doi: 10.3390/jimaging8100285.

ABSTRACT

Recent advances in object detection facilitated by deep learning have led to numerous solutions in a myriad of fields ranging from medical diagnosis to autonomous driving. However, historical research is yet to reap the benefits of such advances. This is generally due to the low number of large, coherent, and annotated datasets of historical documents, as well as the overwhelming focus on Optical Character Recognition to support the analysis of historical documents. In this paper, we highlight the importance of visual elements, in particular illustrations in historical documents, and offer a public multi-class historical visual element dataset based on the Sphaera corpus. Additionally, we train an image extraction model based on YOLO architecture and publish it through a publicly available web-service to detect and extract multi-class images from historical documents in an effort to bridge the gap between traditional and computational approaches in historical studies.

PMID:36286379 | PMC:PMC9605005 | DOI:10.3390/jimaging8100285

7 - PubMed
Building and Interpreting Deep Similarity Models 2020年9月2日 18:00

7 - PubMed

Building and Interpreting Deep Similarity Models

作者Oliver Eberle · Jochen Buttner · Florian Krautli · Klaus-Robert Muller · Matteo Valleriani · Gregoire Montavon

2020年9月2日 18:00

IEEE Trans Pattern Anal Mach Intell. 2022 Mar;44(3):1149-1161. doi: 10.1109/TPAMI.2020.3020738. Epub 2022 Feb 3.

ABSTRACT

Many learning algorithms such as kernel machines, nearest neighbors, clustering, or anomaly detection, are based on distances or similarities. Before similarities are used for training an actual machine learning model, we would like to verify that they are bound to meaningful patterns in the data. In this paper, we propose to make similarities interpretable by augmenting them with an explanation. We develop BiLRP, a scalable and theoretically founded method to systematically decompose the output of an already trained deep similarity model on pairs of input features. Our method can be expressed as a composition of LRP explanations, which were shown in previous works to scale to highly nonlinear models. Through an extensive set of experiments, we demonstrate that BiLRP robustly explains complex similarity models, e.g., built on VGG-16 deep neural network features. Additionally, we apply our method to an open problem in digital humanities: detailed assessment of similarity between historical documents, such as astronomical tables. Here again, BiLRP provides insight and brings verifiability into a highly engineered and problem-specific similarity model.

PMID:32870784 | DOI:10.1109/TPAMI.2020.3020738