225 lines (149 loc) · 5.85 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog, and this project adheres to Semantic Versioning.

v1.9.0 - 28-03-2025

Added

Multiple streams supported

v1.8.0 - 21-03-2025

Added

Support for counting total allocated memory via Session::getInstance().totalAllocatedBytes()

v1.7.2 - 6-12-2024

Fixed

Using alias instead of email in setup.py

v1.7.1 - 4-12-2024

Fixed

Compatibility between Python and C++ in how the data is stored in bt files

v1.7.0 - 3-12-2024

Added

The project now uses the GNU General Public License (GPL) v3; license file added
Introduces new Python package for storing and loading numpy arrays; in can be installed with pip install gputils-api; unit tests and documentation

Fixed

When compiling with cmake, the unit tests will not be compiled by default unless the flag GPUTILS_BUILD_TEST is set
Clang clippy recommendations applied
Proper error handling when binary tensor file is not found

v1.6.0 - 2-12-2024

Added

Method saveToFile becomes DTensor<T>::saveToFile(std::string pathToFile, Serialisation ser), i.e., the user can choose whether to save the file as a text (ASCII) file, or a binary one

Changed

Method parseFromTextFile renamed to parseFromFile (supports text and binary formats)

v1.5.2 - 1-12-2024

Fixed

Quick bug bix in DTensor::parseFromTextFile (passing storage mode to vectorFromTextFile)

v1.5.1 - 29-11-2024

Fixed

Set precision in DTensor::saveToFile properly
DTensor<T>::parseFromFile throws std::invalid_argument if T is unsupported

v1.5.0 - 27-11-2024

Added

Created methods for serialising and deserialising DTensor objects

v1.4.0 - 25-11-2024

Changed

(Breaking change) The methods CholeskyFactoriser::factorise, CholeskyFactoriser::solve, QRFactoriser::factorise, QRFactoriser::leastSquares, and QRFactoriser::getQR are now void and do not return a status code. Instead, a status code is returned by called statusCode. This change leads to a reduction in data being downloaded from the GPU.
In Svd a status code (bool) is returned from Svd<double>::factorise only if the #GPUTILS_DEBUG_MODE is defined, otherwise, the method returns always true.
New base class IStatus used for a universal implementation of info()

v1.3.2 - 8-11-2024

Fixed

When slicing a DTensor along axis=2, update the pointer to matrices
We got rid of warning DTensor<T>::createRandomTensor

v1.3.1 - 7-11-2024

Fixed

Memory management improvements: we got rid of pointerToMatrices, which would unnecessarily allocate memory and addAB does not allocate any new memory internally.

v1.3.0 - 11-10-2024

Added

Left/right Givens rotations
GivensAnnihilator implemented

v1.2.1 - 07-10-2024

Added

Patch initialisation of Q in QR decomposition.
Add test for tall skinny matrices.

v1.2.0 - 04-10-2024

Added

Implementation and test of QR factorisation for tall or square matrices.
Solve least-square problems with QR factorisation.
Improve documentation.

v1.1.0 - 03-08-2024

Added

Implementation and test of methods .maxAbs() and .minAbs() for any tensor.

v1.0.0 - 29-05-2024

Added

Support for random tensors
Implementation of CholeskyMultiFactoriser which performs multiple Cholesky factorisations in parallel

Changed

Using a function numBlocks instead of the macro DIM2BLOCKS
Using TEMPLATE_WITH_TYPE_T and TEMPLATE_CONSTRAINT_REQUIRES_FPX for the code to run on both C++17 and C++20

v0.1.0 - 23-04-2024

Added

Implementation and test of Nullspace(DTensor A) method .project(DTensor b)
project will project in place each bi onto the nullspace of Ai

v0.0.0 - 20-04-2024

Added

Implementation of DTensor<T>, which is our basic entity for data storage and maniputation (supports basic linear algebra using cublas and cusolver); implementation of =+, -=, *= (for scalars and other tensors), +, -, * (scalars and tensors), printing (using std::cout <<), computation of norms (Frobenius and sum of absolute values of all elements); device vectors and matrices are tensors
Singular value decomposition using cublas
Least-squares on tensors
Computation of nullspace matrices (on tensor objects)
Cholesky factorisation
Set up unit tests, CI, and CHANGELOG