JPL Technical Report Server

Adaptive fault tolerance for many-core based space-borne computing

Show simple item record James, Mark Springer, Paul Zima, Hans 2014-09-18T21:34:26Z 2014-09-18T21:34:26Z 2010-08-31
dc.identifier.citation 16th International Euro-Par Conference, Ischia, Italy, August 31 - September 3, 2010 en_US
dc.identifier.clearanceno 10-2164
dc.description.abstract This paper describes an approach to providing software fault tolerance for future deep-space robotic NASA missions, which will require a high degree of autonomy supported by an enhanced on-board computational capability. Such systems have become possible as a result of the emerging many-core technology, which is expected to offer 1024-core chips by 2015. We discuss the challenges and opportunities of this new technology, focusing on introspection-based adaptive fault tolerance that takes into account the specific requirements of applications, guided by a fault model. Introspection supports runtime monitoring of the program execution with the goal of identifying, locating, and analyzing errors. Fault tolerance assertions for the introspection system can be provided by the user, domain-specific knowledge, or via the results of static or dynamic program analysis. This work is part of an on-going project at the Jet Propulsion Laboratory in Pasadena, California. en_US
dc.description.sponsorship NASA/JPL en_US
dc.language.iso en_US en_US
dc.publisher Pasadena, CA : Jet Propulsion Laboratory, National Aeronautics and Space Administration, 2010 en_US
dc.subject many-core systems en_US
dc.subject dependability en_US
dc.subject fault tolerance en_US
dc.subject space missions en_US
dc.subject autonomy en_US
dc.title Adaptive fault tolerance for many-core based space-borne computing en_US
dc.type Preprint en_US

Files in this item

This item appears in the following Collection(s)

Show simple item record



My Account