How to quickly multiply 2 matrices on dynamic c-arrays? [closed]

Question

How to quickly multiply 2 matrices on dynamic c-arrays?

The speed of multiplication depends on a number of factors: the size of the matrix, the nature of its elements (including their size), the method of reading, storage, the ring in which calculations are made - these are just some of the questions that must be answered first.
@Zealint need an algorithm at least square everything else is standard

Accepted Answer · 2016-11-14T18:07:04

The standard formula uses "symmetric" indices:

a[i][k] * b[k][j] ^^^----^^^ ^^^----------^^^

You can get performance improvements by transposing the second matrix:

 a[i][k] * b[j][k] ^^^-------^^^ ^^^-------^^^

Now, in both rows, the data is in a row, you can save a pointer that removes unnecessary dereferencing, and the data can normally get into the processor cache.

If the matrices are large enough, then it makes sense to think about asymptotically faster algorithms, for example, Karatsuba.

@ nikita, the transposition is quadratic, and in multiplication - a cube.
If there is still a memory allocation, it is already incomprehensible.
But if there is an option to immediately consider the transposed matrix and it does not spoil anything, then they should use it.
The experiment showed that practically nothing with real sizes - 100x100 - transpose does not ... The gain of about 5% does not compensate for the long and diligent transposition of the matrix.
But trying to save the pointer doesn’t give anything to itself - the compiler and the smart one itself: ideone.com/QoW7x9 If you have incorrectly measured, correct it.

Harry Harry 106k 9 54 132 · Answer 2 · 2016-11-14T17:32:56

Most likely - the usual multiplication of matrices.

The classic formula is

Any "accelerated" Strassen-type algorithms do not make sense for ordinary tasks ...

Harry

106k 9 54 132

Meaning has one matrix invert, as far as I know. - Qwertiy ♦
For greater cache localization? In principle, yes, but with real sizes, usually the matrices in the cache are placed. However, you need to play sometime ... - Harry
Not only. We still save pointer dereferencing. I wrote in the answer? - Qwertiy ♦
This is a dynamic memory - there are no guarantees at all that the adjacent lines lie nearby. - Qwertiy ♦
Well, if a person is so fool to allocate memory is not one piece - then who is he a doctor? ... - Harry

|

gbg gbg 13.1k one 21 43 · Answer 3 · 2016-11-14T17:41:42

"The fastest" - take a specialized library such as MKL, ATLAS, Vienna CL and the like. First of all, they implement BLAS - and this is a gentlemanly set for working with matrices, and secondly, it is really well optimized - (and for ViennaCL - it works on the GPU), which allows it to work faster than a naive implementation.

How to quickly multiply 2 matrices on dynamic c-arrays? [closed]

Closed due to the fact that the participants of the question from Vlad from Moscow , αλεχολυτ , Kromster , HamSter , fori1ton 15 Nov '16 at 6:31 are incomprehensible.

3 answers 3

More articles: