Single precision floating point

Click For Summary

Discussion Overview

The discussion revolves around converting the fraction 80/9 into single-precision floating-point format. Participants explore the steps involved in the conversion process, including binary representation, mantissa calculation, and rounding methods.

Discussion Character

  • Homework-related
  • Technical explanation
  • Debate/contested

Main Points Raised

  • Some participants express confusion about the conversion process and the derivation of certain binary representations, specifically the transition from decimal to binary and the recurring nature of the binary fraction.
  • One participant suggests that the binary representation of 10/9 results in a recurring sequence, leading to the notation 1.000111 000111 000111..., while another participant initially states it as 1.00011 00011 00011.
  • There is a discussion about the geometric series used to explain the binary fraction, with some participants calculating the value of 000111 as 7/64 and others questioning the accuracy of the initial binary expansion.
  • Participants discuss the construction of the floating-point representation, including the sign bit, exponent, and mantissa, with one participant detailing the steps to arrive at the final binary format.
  • There is a mention of a discrepancy in a lecturer's notes regarding the final bits of the floating-point representation, with participants debating whether it was a mistake or a result of rounding.
  • Some participants reference the IEEE standard for floating-point representation and discuss rounding methods, including truncation and rounding up based on the next bit's value.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the exact binary representation of 10/9, with differing views on the recurring sequence and the accuracy of the lecturer's notes. The discussion remains unresolved regarding the final representation and the rounding process.

Contextual Notes

Participants note potential errors in the lecturer's notes and express uncertainty about the correct binary representation and rounding methods. The discussion highlights the complexity of floating-point conversion and the importance of precision in mathematical representation.

Maybe_Memorie
Messages
346
Reaction score
0

Homework Statement



Convert 80/9 to single-precision floating-point.

Homework Equations





The Attempt at a Solution



Right, I don't understand this at all but I'm trying to follow my notes, not very successfully..

80/9 is positive so the sign bit is 0.

Exponent is adjusted so the mantissa is between 1 and 2, so we get 8x(10/9)
The true exponent is 3.

Use binary long division to calculate 10/9 as a recurring binary point number.
9 = (1001)
10 = (1010)

It results with 1.00011 00011 00011 ...

Next he does a geometric series and says "The group 000111 represents 7/64".


I'm lost. Can someone please explain where he's getting 000111 and 7/64 please?
Thank you! :D
 
Physics news on Phys.org
Maybe_Memorie said:

Homework Statement



Convert 80/9 to single-precision floating-point.

Homework Equations





The Attempt at a Solution



Right, I don't understand this at all but I'm trying to follow my notes, not very successfully..

80/9 is positive so the sign bit is 0.

Exponent is adjusted so the mantissa is between 1 and 2, so we get 8x(10/9)
The true exponent is 3.

Use binary long division to calculate 10/9 as a recurring binary point number.
9 = (1001)
10 = (1010)

It results with 1.00011 00011 00011 ...

Next he does a geometric series and says "The group 000111 represents 7/64".


I'm lost. Can someone please explain where he's getting 000111 and 7/64 please?
Thank you! :D

Think of 000111 being to the right of a "binary" point (instead of decimal point). The leftmost zero is in the 1/2's place (2-1). The next zero is in the 1/4 th's place (2-2), and so on.

So 000111 represents 0 * 1/2 + 0 * 1/4 + 0 * 1/8 + 1 * 1/16 + 1 * 1/32 + 1 * 1/64

If you add the three fractions on the right, what do you get?
 
You get 7/64 :smile:

But I don't see where 000111 came from, as in his solution it's 00011 that's recurring. :confused:
 
Maybe_Memorie said:
It results with 1.00011 00011 00011 ...

Next he does a geometric series and says "The group 000111 represents 7/64".

I'm lost. Can someone please explain where he's getting 000111 and 7/64 please?
The binary expansion you wrote is incorrect. It should be 1.000111 000111 000111… That's where he got 000111 from, if that's what you were wondering.
 
Wait, I see it now! :smile:

So he uses a geometric series and gets 10/9, which shows the result is correct.

He continues the fractional part until it exceeds 23 bits
.00011100011100011100011

He then gets the required number in binary and I've no idea how :confused:
 
Now, as to where the 000111 comes from,

It results with 1.00011 00011 00011

This part is, I believe, incorrect. I did the division myself, and got 1.000111 000111 000111... To be honest I quit after 1.000111, but I'm reasonably sure that the part that repeats is 000111, which is what you were asking about, and is different from the results I quoted.
 
80/9 is equal to 8.888…, which in binary is 1000.111000111000…2. Converting to binary is the first thing you have to know how to do. There are techniques you can use to crank this out relatively quickly.

Now you want to write this in base-2 scientific notation: 1.000111000111…x23.

From here, it's easy to construct the floating-point representation. The number is positive, so the first bit is 0. The exponent is 3. You add 127 to it, which gives 130=100000102, which are the next eight bits. Finally, the first digit of the mantissa will always be a 1, so you can drop it and just copy the first 23 bits following the binary point. Putting it all together, you get

80/9 = 0 10000010 00011100011100011100011
 
Okay I understand so far. :smile:

However my lecturer gets an answer of
0 1000 0010 000111 000111 000111 00100
= 0100 0001 0000 1110 0011 1000 1110 0100 =
= 4 1 0 e 3 8 e 4

I understand how he goes from the first line to the second to the third, but not where the first line comes from.
 
How to get the first line is what I explained in my previous post!
 
  • #10
That's what I assumed but it threw me off seeing 00100 at the end of his first line because this doesn't appear anywhere else
 
  • #11
Ah, I didn't notice that. [STRIKE]That's just a mistake on his part. The last four bits should be 0011, and the last hexadecimal digit should be 3.[/STRIKE]

I just realized it's due to rounding. The next bit would have been a 1, so you round up and 3 becomes a 4.
 
  • #12
vela said:
[STRIKE]That's just a mistake on his part. The last four bits should be 0011, and the last hexadecimal digit should be 3.[/STRIKE]

I wouldn't rule that out. His notes, especially his Group theory ones, are full of mistakes and paradoxes. :rolleyes:


Thanks very much for the help! :)
 
  • #13
vela said:
I just realized it's due to rounding. The next bit would have been a 1, so you round up and 3 becomes a 4.
That's my thought, too. Do you know if there's a rule so that when the next bit is 0, you truncate, and if it's a 1, you round up?
 
  • #14
I think so. It's mentioned in the Wikipedia article on floating-point representation.
 
  • #15
The IEEE standard specifies 5 rounding algorithms:
http://en.wikipedia.org/wiki/IEEE_754-2008#Rounding_algorithms

However, in practice you'd always want to minimize the error in floating point arithmetic, meaning you'd round to the nearest representation.
In this case that means rounding up (without a tie).
 

Similar threads

Replies
10
Views
4K
  • · Replies 4 ·
Replies
4
Views
2K
Replies
4
Views
2K
  • · Replies 5 ·
Replies
5
Views
6K
Replies
6
Views
10K
  • · Replies 23 ·
Replies
23
Views
5K
  • · Replies 4 ·
Replies
4
Views
5K
  • · Replies 23 ·
Replies
23
Views
4K
  • · Replies 4 ·
Replies
4
Views
8K
  • · Replies 8 ·
Replies
8
Views
6K