1. Home
2. Questions
3. AI Assist
4. Tags
5. Challenges
6. Chat
7. Articles
8. Users
9. Companies
11. Communities for your favorite technologies. Explore all Collectives
Stack Internal

Stack Overflow for Teams is now called Stack Internal. Bring the best of human thought and AI automation together at your work.
Try for free Learn more
Bring the best of human thought and AI automation together at your work. Learn more

how to read info from binary file

Asked 14 years, 5 months ago

Viewed 650 times

I have a binary file and specifications:

after 'abst' (0x61627374):
var1 Unsigned 8-bit integer
var2 Unsigned 24-bit integer
var3 Sequence of Unicode 8-bit characters (UTF-8), terminated with 0x00

How to read var1,var2,var3 from file ?

python

Improve this question

asked Jul 20, 2011 at 11:55

bdfy's user avatar

bdfy

2852 gold badges5 silver badges7 bronze badges

Add a comment |

2 Answers 2

Sorted by: Reset to default

Quick and dirty and not tested:

# assumption: the file is small enough to fit into the RAM
# and also that 'abst' does not occur in the dataset
for hunk in input.split('abst')[1:]: # skip first hunk, since it is the stuff befor the first 'abst' occurence
 var1 = ord(hunk[0])
 var2 = ord(hunk[1]) + ord(hunk[2])*256 + ord(hunk[3])*256*256
 var3 = hunk[4:].split('\x00')[0]

Improve this answer

edited Jul 20, 2011 at 12:50

answered Jul 20, 2011 at 12:01

Rudi's user avatar

Rudi

20.1k3 gold badges58 silver badges78 bronze badges

2 Comments

John Machin

John Machin Over a year ago

also if there is guff before 'abst' you will unpack that.

2011年07月20日T12:25:09.337Z+00:00

Rudi

Rudi Over a year ago

@John Thank you. (And I even got caught by the first hunk error yesterday grml)

2011年07月20日T12:53:09.277Z+00:00

The bitstring module might be helpful here as you have unusual bit lengths, and it can be a bit more readable than unpacking values 'by hand':

import bitstring
bitstring.bytealigned = True
s = bitstring.ConstBitStream(your_file)
if s.find('0x61627374'): # seeks to your start code
 start_code, var1, var2 = s.readlist('bytes:4, uint:8, uint:24')
 p1 = s.pos
 p2 = s.find('0x00', start=p1) # find next '\x00'
 var3 = s[p1:p2+8].bytes # and interpret the slice as bytes

Improve this answer

answered Jul 21, 2011 at 15:36

Scott Griffiths's user avatar

Scott Griffiths

22k8 gold badges58 silver badges86 bronze badges

Comments

Your Answer

Draft saved

Draft discarded

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking "Post Your Answer", you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

python

See similar questions with these tags.

lang-py

CollectivesTM on Stack Overflow

how to read info from binary file

2 Answers 2

2 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Hot Network Questions

CollectivesTM on Stack Overflow

2 Answers 2

2 Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Post as a guest

Related