Group: testing
Topic: implementing distributed systems and applications
Topic: reliability of distributed systems
Topic: software maintenance by patching
Topic: testing by voting or N-version
Topic: updating information in a distributed system
| |
Subtopic: non-recurring bugs
Quote: bugs are discovered when they are repetitious; problem of non-recurring bugs such as race conditions [»cbb_1980, OK]
| Subtopic: logs
Quote: Grapevine logs a week's activity at each server; useful for reconstructing failures [»schrMD2_1984]
| Quote: debug Grapevine by observing the logs of multiple servers [»schrMD2_1984]
| Subtopic: reproducing remote errors
Quote: run test cases from remote systems on the local site to help identify the error; e.g., a misconfiguration or old version [»nowiDA8_1978]
| Subtopic: reversible, distributed execution
Quote: added reversible execution across multiple processes to Simula; particularly useful for debugging [»lafoF3_1984]
| Subtopic: message handler
Quote: since the message handler sees all messages, messages are easily traced for simulation and checking [»katzL_1981]
| Subtopic: centralized server
Quote: run model calculations on a central machine to give its author control over program maintenance [»gammR_1982]
| Subtopic: telephone network
Quote: used two years of outage records to study failures of the US Public Switched Telephone Network [»kuhnDR4_1997]
| Quote: overloads of the telephone network caused almost half of the outage minutes; human error and acts of nature where other major causes [»kuhnDR4_1997]
| Quote: human intervention was able to correct many failures of the telephone network in under one hour [»kuhnDR4_1997]
| Subtopic: telephone software
Quote: half of the software in telephone switches concerns error detection and correction; this may explain the low outage rate due to software [»kuhnDR4_1997]
| Quote: software errors caused less downtime of the telephone network, 2%, than any other source of failure except vandalism
| Quote: restarting a telephone switch temporarily fixed a significant number of software-caused outages
| Subtopic: simulation
Quote: robot simulation particularly useful for testing synchronization and signaling among multiple devices [»craiJJ4_1987]
| Subtopic: automate remote maintenance
Quote: need a full system for remote software maintenance; exchanging tapes results in out-of-date versions at remote sites [»nowiDA8_1978]
| Subtopic: programmable controllers
Quote: with programmable controllers, can change reactors dynamically [»instrumentcontrol]
|
Related Topics
Group: testing (18 topics, 557 quotes)
Topic: implementing distributed systems and applications (41 items)
Topic: reliability of distributed systems (35 items)
Topic: software maintenance by patching (27 items)
Topic: testing by voting or N-version (10 items)
Topic: updating information in a distributed system (50 items)
|