skip to main content
Primo Search
Search in: Busca Geral

Metadata and data files supporting the related article: Application of convolutional neural networks to breast biopsies to delineate tissue correlates of mammographic breast density

Mullooly, Maeve ; Bejnordi, Babak Ehteshami ; Pfeiffer, Ruth M. ; Shaoqi Fan ; Palakal, Maya ; Hada, Manila ; Vacek, Pamela M. ; Weaver, Donald L. ; Shepherd, John A. ; Fan, Bo ; Mahmoudzadeh, Amir Pasha ; Wang, Jeff ; Malkov, Serghei ; Johnson, Jason M. ; Herschorn, Sally D. ; Sprague, Brian L. ; Hewitt, Stephen ; Brinton, Louise A. ; Karssemeijer, Nico ; Laak, Jeroen Van Der ; Beck, Andrew ; Sherman, Mark E. ; Gierach, Gretchen L.

figshare 2019

Texto completo disponível

Citações Citado por
  • Título:
    Metadata and data files supporting the related article: Application of convolutional neural networks to breast biopsies to delineate tissue correlates of mammographic breast density
  • Autor: Mullooly, Maeve ; Bejnordi, Babak Ehteshami ; Pfeiffer, Ruth M. ; Shaoqi Fan ; Palakal, Maya ; Hada, Manila ; Vacek, Pamela M. ; Weaver, Donald L. ; Shepherd, John A. ; Fan, Bo ; Mahmoudzadeh, Amir Pasha ; Wang, Jeff ; Malkov, Serghei ; Johnson, Jason M. ; Herschorn, Sally D. ; Sprague, Brian L. ; Hewitt, Stephen ; Brinton, Louise A. ; Karssemeijer, Nico ; Laak, Jeroen Van Der ; Beck, Andrew ; Sherman, Mark E. ; Gierach, Gretchen L.
  • Assuntos: Artificial Intelligence and Image Processing ; Cancer ; Cancer Diagnosis ; Computational Biology ; FOS: Clinical medicine ; FOS: Computer and information sciences ; Medicine ; Oncology and Carcinogenesis not elsewhere classified ; Pattern Recognition and Data Mining
  • Notas: 10.1038/s41523-019-0134-6
    RelationTypeNote: IsSupplementTo -- 10.1038/s41523-019-0134-6
  • Descrição: Breast density is a radiologic feature that reflects fibroglandular tissue content relative to breast area or volume, and it is a breast cancer risk factor. This study employed deep learning approaches to identify histologic correlates in radiologically-guided biopsies that may underlie breast density and distinguish cancer among women with elevated and low density. Data access: Datasets supporting figure 2, tables 2 and 3 and supplementary table 2 of the published article are publicly available in the figshare repository, as part of this data record (https://doi.org/10.6084/m9.figshare.9786152). These datasets are contained in the zip file NPJ FigShare.zip. Datasets supporting figure 3, table 1 and supplementary table 1 of the published article are not publicly available to protect patient privacy, but can be made available on request from Dr. Gretchen L. Gierach, Senior Investigator, Division of Cancer Epidemiology and Genetics, National Cancer Institute, Bethesda, MD, USA, email address: gierachg@mail.nih.gov. Study description and aims: The study aimed to identify tissue correlates of breast density that may be important for distinguishing malignant from benign biopsy diagnoses separately among women with high and low breast density, to help inform cancer risk stratification among women undergoing a biopsy following an abnormal mammogram. Haematoxylin and eosin (H&E)-stained digitized images from image-guided breast biopsies (n=852 patients) were evaluated. Breast density was assessed as global and localized fibroglandular volume (%). A convolutional neural network characterized H&E composition. 37 features were extracted from the network output, describing tissue quantities and morphological structure. A random forest regression model was trained to identify correlates most predictive of fibroglandular volume (n=588). Correlations between predicted and radiologically quantified fibroglandular volume were assessed in 264 independent patients. A second random forest classifier was trained to predict diagnosis (invasive vs. benign); performance was assessed using area under receiver-operating characteristics curves (AUC). For more details on the methodology please see the published article. Study approval: The Institutional Review Boards at the NCI and the University of Vermont approved the protocol for this project for either active consenting or a waiver of consent to enrol participants, link data and perform analytical studies. Dataset descriptions: Data supporting figure 2: Datasets Figure 2A H&E.jpg, Figure 2A Mammogram.jpg, Figure 2B H&E.jpg and Figure 2B Mammogram.jpg are in .jpg file format and consist of histological whole slide H&E images and corresponding full-field digital mammograms from patients whose biopsies yielded diagnoses of atypical ductal hyperplasia and invasive carcinoma. Data supporting figure 3: Dataset Figure 3.xls is in .xls file format and contains raw data used to generate the Receiver Operating Characteristic (ROC) curves for the prediction of invasive cancer among women with high percent global fibroglandular volume, low percent global fibroglandular volume, high percent localized fibroglandular volume and low percent localized fibroglandular volume. Data supporting table 1: Dataset Table1_analysis.sas7bdat is in SAS file format and contains the characteristics of study participants in the BREAST Stamp Project, who were referred for an image-guided breast biopsy, stratified by the training and testing sets (n = 852). Data supporting table 2: Datasets Global FGV.xls (accompanying Global FGV.png file) and Localized FGV.xls (accompanying Localized FGV.png file) are in .xls file format and the accompanying files are in .png file format. The data contain histologic features identified in the random forest model for the prediction of global and localized % fibroglandular volume. Data supporting table 3: Datasets HighGlobal_feature_importance.xls, HighGlobal_feature_importance.pdf, HighLocal_feature_importance.xls, HighLocal_feature_importance.pdf, LowGlobal_feature_importance.xls, LowGlobal_feature_importance.pdf, LowLocal_feature_importance.xls, LowLocal_feature_importance.pdf are in .xls file format. The accompanying figures generated from the data in the .xls files are in .pdf file format. These files contain histologic features identified in the random forest model for the prediction of invasive cancer status among women with high vs. low % fibroglandular volume. Data supporting supplementary table 1: Datasets testfeatures.xls and trainfeatures.xls are in .xls file format and include the distribution and description of the 37 histologic features extracted from the convolutional neural network deep learning output in the H&E stained whole slide images from the training and testing sets. Data supporting supplementary table 2: Datasets All_samples_global.xls, All_samples_global.png, All_samples_local.xls, All_samples_local.png, PostMeno_global.xls, PostMeno_global.png, PostMeno_local.xls, PostMeno_local.png, PreMeno_global.xls, PreMeno_global.png, PreMeno_local.xls, PreMeno_local.png are in .xls file format. The accompanying figures generated from the data in the .xls files are in .png file format. These data include the histologic features identified in the random forest model that included BMI for the prediction of global and localized % fibroglandular volume.Software needed to access the data: Data files in SAS file format require the SAS software to be accessed.
  • Editor: figshare
  • Data de criação/publicação: 2019
  • Idioma: Inglês

Buscando em bases de dados remotas. Favor aguardar.