Floating point quantization from double to 8bit

How can I round a double precision floating point to the value that can be stored in a 8bit floating point? I'm trying to do it mathematically but I have no idea how to do.

I have an x double number and I should find the nearest y that I can express with n*2^b with n and b integer and n in [-128,127]. But how can I find the best n and b?

Solution

I've solved with this algorithm:

function y = DoubleTo8bit( x )
s=sign(x);
x=abs(x);

if x==0
    y=0;
    return; 
end
b=floor(log2(x)+1)-8+(s>0);
m=s*round(x/2^b);

y=m*2^b;
end

Convert float 2-D array to integer 2-D array in Julia
Compare floating point numbers as integers
Converting IEEE 754 floating point in Haskell Word32/64 to and from Haskell Float/Double
#pragma STDC FENV_ACCESS
PHP round to integer
How to efficiently perform double/int64 conversions with SSE/AVX?
How many digits can I rely on?
What are the chances of Math.random returning 0?
Compare floats in php
Converting exponential to integer in python
Relationships between 128, 64, and 32 bit IEEE-754 floating point numbers
Rounding to nearest or up in SQLite
How to choose epsilon value for floating point?
Performance penalty: denormalized numbers versus branch mis-predictions
Floating Point: Why does the implicit 1 change the value of the fractional part?
Is there a floating point value of x, for which x-x == 0 is false?
Check if double or floating point number is within Flutter supported limits
How to print float to n decimal places including trailing 0s?
Float and double datatype in Java
Where to find information about the exact binary representation of floating point values used by avr-gcc when compiling for 8-bit processors?
How can I test for negative zero?
Get float value without its unit of measure from a string
Get float number in string which is nor preceded by $ and not followed by %
Get float value after £ symbol in a string and cast as a float type value
Parse string and isolate float value after currency symbol
Biggest integer that can be stored in a double
What is the difference between quiet NaN and signaling NaN?
How transfer Float number due to HTTP GET request?
How can I sort a vector of floats in Rust?
How do I print in double precision?