Numpy: Reduce memory footprint of dot product with random data

I have a large numpy array that I am going to take a linear projection of using randomly generated values.

>>> input_array.shape
(50, 200000)
>>> random_array = np.random.normal(size=(200000, 300))
>>> output_array = np.dot(input_array, random_array)

Unfortunately, random_array takes up a lot of memory, and my machine starts swapping. It seems to me that I don't actually need all of random_array around at once; in theory, I ought to be able to generate it lazily during the dot product calculation...but I can't figure out how.

How can I reduce the memory footprint of the calculation of output_array from input_array?

Solution

This obviously isn't the fastest solution, but have you tried:

m, inner = input_array.shape
n = 300
out = np.empty((m, n))
for i in xrange(n):
    out[:, i] = np.dot(input_array, np.random.normal(size=inner))

Convert numbers in millions and thousands to string format
How can I make the image centered while padding in python
Unable to allocate array with shape and data type
numpy.unique with order preserved
Numerically obtaining response of a damped driven oscillator gives peak at wrong frequency
How to create a DataFrame of random integers with Pandas?
Pandas read_csv: low_memory and dtype options
Why does my implementation of trilateration give wrong results?
Pytorch tensor to numpy array
How to compute scipy sparse matrix determinant without turning it to dense?
Passing a NumPy 3d array to a C function with a triple pointer as an argument
How to generate a snapshot of a field in a time step with VTK and Python
Training a Keras model to identify leap years
Create arbitrary multidimensional zeros array
Representing tridiagonal matrix using numpy
In an array of counters that reset, find the start-end index for counter
NumPy array is not JSON serializable
Using numpy `as_strided` function to create patches, tiles, rolling or sliding windows of arbitrary dimension
How to fix/reset decreasing timestamps while preserving gaps in time-series data for CNN training?
Numpy/scipy - How to find the least squares solution with the constraint that Ax >= b?
Fast calculation of Pareto front in Python
How to convert an imagehash to a numpy array?
TypeError: Cannot convert numpy.ndarray to numpy.ndarray
extracting days from a numpy.timedelta64 value
How can i display the python scipt data within flask application
What does the mode 'reduced' in numpy.linalg.qr do?
Is there an easy way to implement width for `numpy.base_repr`?
Calculate a monthly minimum, mean and maximum on daily temperature data for February with Python
Convolving with a gaussian kernel vs Gaussian blur
Surprising lack of speedup in caching numpy calculations