Yissum - Research Development Company of the Hebrew University

High Performance Fault Tolerant Matrix Multiplication Algorithms

Posted by Yissum - Research Development Company of the Hebrew UniversityResponsive · Innovative Products and Technologies · Israel

Summary of the technology

High Performance Fault Tolerant Matrix Multiplication Algorithms
Project ID : 10-2017-4458

Description of the technology

Reduces incidence of serious errors and costs


Computer Science & Engineering

Development Stage


Patent Status

Patent pending


  • Errors are a serious concern in high performance computing. Increase in machine size and decrease in operating voltage lead to increase in the incidence of hard errors (component failure) and soft errors (bit flip).
  • There are various general-purpose hard error resiliency solutions, but these tend to be costly and severely degrade performance.
  • More efficient solutions incurring much lower overhead are available for numerical linear algebra computations based on distributed ”2D” algorithms, but these can guarantee high performance only when matrices feel all local memories otherwise there is a degradation in performance.

Our Innovation

Fault tolerance matrix multiplication algorithms that reduce overheads on resources. The algorithms reduce both the number of additional processors required and communication costs, while resulting in only a negligible penalty on the flops count.

See attached tables.

Key Features

  • In the case of 2D algorithms, the number of additional processors required is reduced and a significant factor of the latency cost shaved off.
  • In cases of local memory larger than the minimum needed to store inputs and outputs, fault tolerant adaptations of blocked 2.5D algorithms and BFS-DFS (breadth first search and depth first search) algorithms are obtained that attain low order communication costs, require very few or no additional processors and have negligible overhead on the flops count.
  • Strassen and Strassen-like algorithms attain resiliency with small overhead costs.
  • Lower bounds on resources and performance parameters such as, additional processors, flops, and communication costs are formulated as functions of machine parameters, input size, maximum number of simultaneous faults and total number of faults showing that the algorithms are optimal or close to optimal.

Development Milestones

  • Submission of paper(s)
  • Testing, implementation, benchmarking

The Opportunity

  • The algorithms will be used in large scale computing, such as clouds, supercomputers, medium-large clusters, where occasional hard-errors are inevitable and are likely to harm the computation.
  • Interested parties may include Intel, AMD, NVIDIA, and any other hardware provider that supports numerical computations as well as software providers for large scale or error-sensitive computing.


Project manager

Tamir Huberman
VP Business Dev. Computer Science & IT Director

Project researchers

Oded Schwartz
HUJI, School of Computer Science and Engineering

Related keywords

  • Information Processing, Information System, Workflow Management
  • IT and Telematics Applications
  • Multimedia
  • Computers
  • Computer Graphics Related
  • Specialised Turnkey Systems
  • Scanning Related
  • Peripherals
  • Computer Services
  • Computer Software Market
  • Other Computer Related
  • algorithms
  • Computer Science & Engineering

About Yissum - Research Development Company of the Hebrew University

Technology Transfer Office from Israel

Yissum Research Development Company of the Hebrew University of Jerusalem Ltd. Founded in 1964 to protect and commercialize the Hebrew University’s intellectual property. Ranked among the top technology transfer companies, Yissum has registered over 8,900 patents covering 2,500 inventions; has licensed out 800 technologies and has spun-off 90 companies. Products that are based on Hebrew University technologies and were commercialized by Yissum generate today over $2 Billion in annual sales.

Send your request

By clicking "Send your request" you are signing up and accepting our Terms of Service and Privacy policy

Technology Offers on Innoget are directly posted and managed by its members as well as evaluation of requests for information. Innoget is the trusted open innovation and science network aimed at directly connect industry needs with professionals online.