skip to main content
Visitante
Meu Espaço
Minha Conta
Sair
Identificação
This feature requires javascript
Tags
Revistas Eletrônicas (eJournals)
Livros Eletrônicos (eBooks)
Bases de Dados
Bibliotecas USP
Ajuda
Ajuda
Idioma:
Inglês
Espanhol
Português
This feature required javascript
This feature requires javascript
Primo Search
Busca Geral
Busca Geral
Acervo Físico
Acervo Físico
Produção Intelectual da USP
Produção USP
Search For:
Clear Search Box
Search in:
Busca Geral
Or hit Enter to replace search target
Or select another collection:
Search in:
Busca Geral
Busca Avançada
Busca por Índices
This feature requires javascript
This feature requires javascript
Fault-Tolerance Techniques for High-Performance Computing
Herault, Thomas ; Robert, Yves Robert, Yves ; Herault, Thomas ; Herault, Thomas ; Robert, Yves
Cham: Springer Nature 2015
Texto completo disponível
Citações
Citado por
Exibir Online
Detalhes
Resenhas & Tags
Mais Opções
Nº de Citações
This feature requires javascript
Enviar para
Adicionar ao Meu Espaço
Remover do Meu Espaço
E-mail (máximo 30 registros por vez)
Imprimir
Link permanente
Referência
EasyBib
EndNote
RefWorks
del.icio.us
Exportar RIS
Exportar BibTeX
This feature requires javascript
Título:
Fault-Tolerance Techniques for High-Performance Computing
Autor:
Herault, Thomas
;
Robert, Yves
Robert, Yves
;
Herault, Thomas
;
Herault, Thomas
;
Robert, Yves
Assuntos:
Computer programming, programs, data
;
Computer Science
;
Distributed, Parallel, and Cluster Computing
;
Electronic data processing
;
Numeric Computing
;
Performance and Reliability
;
System Performance and Evaluation
Descrição:
This timely text/reference presents a comprehensive overview of fault tolerance techniques for high-performance computing (HPC).The text opens with a detailed introduction to the concepts of checkpoint protocols and scheduling algorithms, prediction, replication, silent error detection and correction, together with some application-specific techniques such as algorithm-based fault tolerance. Emphasis is placed on analytical performance models. This is then followed by a review of general-purpose techniques, including several checkpoint and rollback recovery protocols. Relevant execution scenarios are also evaluated and compared through quantitative models.Topics and features: includes self-contained contributions from an international selection of preeminent experts; provides a survey of resilience methods and performance models; examines the various sources for errors and faults in large-scale systems, detailing their characteristics, with a focus on modeling, detection and prediction; reviews the spectrum of techniques that can be applied to design a fault-tolerant message passing interface; investigates different approaches to replication, comparing these to the traditional checkpoint-recovery approach; discusses the challenge of energy consumption of fault-tolerance methods in extreme-scale systems, proposing a methodology to estimate such energy consumption.This authoritative volume is essential reading for all researchers and graduate students involved in high-performance computing.
Títulos relacionados:
Computer Communications and Networks
Editor:
Cham: Springer Nature
Data de criação/publicação:
2015
Formato:
325
Idioma:
Inglês
Links
View record in HAL
This feature requires javascript
This feature requires javascript
Voltar para lista de resultados
This feature requires javascript
This feature requires javascript
Buscando em bases de dados remotas. Favor aguardar.
Buscando por
em
scope:(USP_VIDEOS),scope:("PRIMO"),scope:(USP_FISICO),scope:(USP_EREVISTAS),scope:(USP),scope:(USP_EBOOKS),scope:(USP_PRODUCAO),primo_central_multiple_fe
Mostrar o que foi encontrado até o momento
This feature requires javascript
This feature requires javascript