tensor

In this module we build a small Tensor in C, along the lines of torch.Tensor or numpy.ndarray. The current code implements a simple 1-dimensional float tensor that we can access and slice. We get to see that the tensor object maintains both a Storage that holds the 1-dimensional data as it is in physical memory, and a View over that memory that has some start, end, and stride. This allows us to efficiently slice into a Tensor without creating any additional memory, because the Storage is re-used, while the View is updated to reflect the new start, end, and stride. We then get to see how we can wrap our C tensor into a Python module, just like PyTorch and numpy do.

The source code of the 1D Tensor is in tensor1d.h and tensor1d.c. The Python module that wraps this C code is in tensor1d.py. It uses the cffi library to interface with the C code. The tests use pytest and can be found in test_tensor1d.py.

Example usage:

import tensor1d

# 1D tensor of [0, 1, 2, ..., 19]
t = tensor1d.arange(20)

# getitem / setitem functionality
print(t[3]) # prints 3.0
t[-1] = 100 # sets the last element to 100.0

# slicing, prints [5, 7, 9, 11, 13]
print(t[5:15:2])

# slice of a slice works ok! prints [9, 11, 13]
# (note how the end range is oob and gets cropped)
print(t[5:15:2][2:7])

It is important to understand this topic because you can get fairly fancy with torch tensors and you have to be careful and aware of the memory underlying your code, when we're creating new storage or just a new view, functions that may or may not only accept "contiguous" tensors. Another pitfall is when you e.g. create a small slice of a big tensor, assuming that somehow the big tensor will be garbage collected, but in reality the big tensor will still be around because the small slice is just a view over the big tensor's storage. The same would be true of our own tensor here.

Actual production-grade tensors like torch.Tensor have a lot more functionality we won't cover. You can have different dtype not just float, different device, different layout, and tensors can be quantized, encrypted, etc etc., we will not cover these here.

TODOs:

bring our own implementation closer to torch.Tensor
implement a few simple ops like add, multiply, etc.
make tests better
implement 2D tensor, where we have to start worrying about 2D shapes/strides
implement broadcasting for 2D tensor

Good related resources:

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Makefile		Makefile
README.md		README.md
tensor1d.c		tensor1d.c
tensor1d.h		tensor1d.h
tensor1d.py		tensor1d.py
test_tensor1d.py		test_tensor1d.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tensor

License

About

Releases

Packages

Contributors 3

Languages

EurekaLabsAI/tensor

Folders and files

Latest commit

History

Repository files navigation

tensor

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages