1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
|
\input texinfo @c -*-texinfo-*-
@c This file uses the @command command introduced in Texinfo 4.0.
@c %**start of header
@setfilename bbe.info
@settitle bbe - binary block editor
@finalout
@c %**end of header
@set VERSION 0.2.2
@ifinfo
@dircategory Utilities
@direntry
* bbe: (bbe). Binary Block Editor.
@end direntry
@end ifinfo
@copying
This file documents version @value{VERSION} of @command{bbe}, a binary block editor.
Copyright @copyright{} 2005 Timo Savinen
@quotation
Permission is granted to make and distribute verbatim copies of
this manual provided the copyright notice and this permission notice
are preserved on all copies.
Permission is granted to copy and distribute modified versions of this
manual under the conditions for verbatim copying, provided that the entire
resulting derived work is distributed under the terms of a permission
notice identical to this one.
Permission is granted to copy and distribute translations of this manual
into another language, under the above conditions for modified versions.
@end quotation
@end copying
@titlepage
@title bbe
@subtitle binary block editor
@subtitle Version @value{VERSION}
@author by Timo Savinen
@page
@vskip 0pt plus 1filll
@insertcopying
@end titlepage
@c All the nodes can be updated using the EMACS command
@c texinfo-every-node-update, which is normally bound to C-c C-u C-e.
@ifnottex
@node Top, Overview, (dir), (dir)
@top bbe
@insertcopying
@end ifnottex
@c All the menus can be updated with the EMACS command
@c texinfo-all-menus-update, which is normally bound to C-c C-u C-a.
@menu
* Overview:: Preliminary information.
* Samples:: Samples using @command{bbe}.
* Invoking bbe:: How to run @command{bbe}.
* bbe programs:: How @command{bbe} works.
* Problems:: Reporting bugs.
@end menu
@node Overview, Samples, Top, Top
@chapter Preliminary information
@cindex greetings
@cindex overview
The @command{bbe} program is a sed-like editor for binary files. @command{bbe}
performs basic byte related transformations on blocks of input stream. @command{bbe} is non-interactive command line tool and
can be used as a part of a pipeline. @command{bbe} makes only one pass over input stream.
@command{bbe} contains also grep-like features, like printing the filename, offset and block number.
@node Samples, Invoking bbe, Overview, Top
@chapter Samples using @command{bbe}
@cindex sample
Few examples of running @command{bbe}:
@table @samp
@item bbe -b "/\x80\x50\x0e/:12" -e "d 0 3" -e "c BCD ASC" -e "A \x0a" -e "w /tmp/numbers" -o /dev/null /tmp/bdata
Task here is to extract BCD coded numbers from the file @file{/tmp/bdata} and write them
in ascii format with newline to file @file{/tmp/numbers}.
12 bytes long blocks containing the BCD-numbers start with three byte sequence of
values @samp{0x80}, @samp{0x50} and @samp{0x0e}. First three bytes (the block start sequence)
are removed (@samp{d 0 3}) rest of the block is transformed from BCD to Ascii (@samp{c BCD ASC}) and a newline character is
appended at the end of the block (@samp{A \x0a}).
All transformed blocks are written to @file{/tmp/numbers} (@samp{w /tmp/numbers}). Nothing is written to
the output (@samp{-o /dev/null}).
@*
@item bbe -b ":525" -e "i 524 \x0a" -o /tmp/data_with_nl /tmp/data
A newline is added after every 525'th byte of the file @file{/tmp/data}. Data with newlines is written to @file{/tmp/data_with_nl}.
@*
@item bbe -b ":526" -e "d 525 1" -o /tmp/data /tmp/data_with_nl
Every 526'th byte (inserted newline in previous example) is removed from the file @file{/tmp/data_with_nl}. Data without newlines is written to @file{/tmp/data}.
@*
@item bbe -e "s/\x0d\x0a/\x0a/"
Same effect as has command @command{dos2unix}.
@end table
@node Invoking bbe, bbe programs, Samples, Top
@chapter How to run @command{bbe}
@cindex running bbe
@cindex using
@command{bbe} accepts several commands to operate on blocks. Commands are
executed in the same order as they appear in command line or in a script file.
Order is significant, because the changes made to current byte by previous commands are
seen by next commands.
@menu
* Invocation:: Program invocation
* Block:: Block definition
* Commands:: bbe commands
* Limits:: Limitations
@end menu
@node Invocation, Block, , Invoking bbe
@section Program invocation
@cindex options
The format for running the @command{bbe} program is:
@example
bbe @var{option} @dots{}
@end example
@code{bbe} supports the following options:
@c Formatting copied from the Texinfo 4.0 manual.
@table @code
@item -b @var{BLOCK}
@itemx --block=@var{BLOCK}
Block definition.
@item -e @var{COMMAND}
@itemx --expression=@var{COMMAND}
Add command(s) to the commands to be executed. Commands must be separated by semicolon.
@item -f @var{script-file}
@itemx --file=@var{script-file}
Add commands from @var{script-file} to the commands to be executed.
@item -o @var{file}
@itemx --output=@var{file}
Write output to @var{file} instead of standard output.
@item -s
@itemx --suppress
Suppress printing of normal output, print only block contents.
@item -?
@itemx --help
Print an informative help message describing the options and then exit
successfully.
@item -V
@itemx --version
Print the version number of @command{bbe} and then exit successfully.
@end table
All remaining options are names of input files, if no input files are specified or @code{-} is given, then the standard input is read.
@node Block, Commands, Invocation, Invoking bbe
@section Block definition
@cindex block
@command{bbe} devides the input stream to blocks defined by the @code{-b} option. If block is
not defined, the whole input stream is considered as one block. Commands have effect only inside a block, rest of the input stream
remains untouched. Currently @command{bbe} supports only one block definition per invocation. If input stream consists of different blocks,
several @command{bbe}s can be combined in a pipeline.
A block can be defined several ways:
@table @code
@item @var{N}:@var{M}
Block starts at offset @var{N} of input stream (first byte is 0). Block is @var{M} bytes long. This definition allows
only one block to be defined.
@item :@var{M}
The whole input stream is divided to @var{M}-size blocks.
@item /@var{start}/:/@var{stop}/
Blocks start with sequence @var{start} and end with sequence @var{stop}. Both @var{start} and @var{stop} are included to blocks.
@item /@var{start}/:
Blocks start with sequence @var{start} and ends at next occurrence of @var{start}. Only the first @var{start} is included to block.
@item :/@var{stop}/
Blocks start at the beginning of input stream or after the end of previous block. Block ends at first occurrence of @var{stop}.
Only the last @var{stop} is included to blocks.
@end table
It is possible to use c-like byte values in @var{N}, @var{M}, @var{start} and @var{stop}.
Values in @var{start} and @var{stop} must be escaped with @code{\}, @code{\} can be escaped as @code{\\}.
Byte values can be expressed in decimal, octal or hexadecimal e.g. in @var{start} and @var{stop}:
@table @code
@item \123, \32 or \0
Decimal values
@item \012, \08 or \0278
Octal values
@item \x0a, \x67 or \xff
Hexadecimal values
@end table
Also escape code @code{\y} can be used. Decimal values of @code{\y}'s:
@table @code
@item \a
7
@item \b
8
@item \t
9
@item \n
10
@item \v
11
@item \f
12
@item \r
13
@item \;
59@*Semicolon must be escaped, because it is a command delimitter.
@end table
Values of @var{N} and @var{M} can be given in decimal, octal and hexadecimal:
@table @code
@item 123, 32 or 112232
Decimal values
@item 0128, 08123 or 0
Octal values
@item x456a, x167 or xffde
Hexadecimal values
@end table
@node Commands, Limits, Block, Invoking bbe
@section @command{bbe} commands
@cindex commands
Commands in @command{bbe} can be divided in two groups: Block related commands
and byte related commands. Block related commands operate at block level e.g. remove a block.
Byte related commands work allways inside a block and they don't have effect beyond the block boundaries.
Same escape codes for byte values in @var{string}s can be used as in @var{start} and @var{stop} of block definition.
@subheading Block commands are:
@table @code
@item I @var{string}
Write the @var{string} to output stream before the block.
@item D [@var{N}]
Delete the @var{N}'th block. If @var{N} is not defined all blocks are deleted from output stream.
@strong{Note}: First block is number one.
@item A @var{string}
Write the @var{string} to output stream after the block.
@item J @var{N}
Commands appearing after this command have no effect until @var{N} blocks are found. Means "Jump first @var{N} blocks".
@strong{Note}: Commands that are defined before this command have effect on every block.
@item L @var{N}
Commands appearing after this command have no effect after @var{N} blocks are found. Means "Leave blocks after @var{N}'th block".
@strong{Note}: Commands that are defined before this command have effect on every block.
@item N
Before block contents the file name where the current block starts is printed with colon.
@item F @var{f}
Before block contents the current stream offset and colon is printed in format specified by @var{f}.
Stream offset starts at zero. @var{f} can have one of following values:
@table @var
@item H
Hexadecimal.
@item D
Decimal.
@item O
Octal.
@end table
@item B @var{f}
Before block contents the current block number and colon is printed in format specified by @var{f}.
Block numbering starts at one. @var{f} can have one of the sames codes as @code{F}-command.
@item > @file{file}
Before printing a block, the contents of file @file{file} is printed.
@item < @file{file}
After printing a block, the contents of file @file{file} is printed.
@end table
@subheading Byte commands are:
@strong{Note}: The @var{n} in byte commands is offset from the beginning of current block, first byte is number zero.
@table @code
@item c @var{from} @var{to}
Converts bytes from @var{from} to @var{to}. Currently supported formats are:
@table @code
@item ASC
Ascii
@item BCD
Binary Coded Decimal
@end table
@strong{Note}: Bytes, that cannot be converted are passed through as they are. e.g. in ASC -> BCD conversion, ASCII characters not
in range @code{'0'} -- @code{'9'} are not converted.
@item d @var{n} @var{m}|*
Delete @var{m} bytes starting from the offset @var{n}. If * is defined instead of @var{m}, then
all bytes of the block starting from @var{n} are deleted.
@item i @var{n} @var{string}
Insert @var{string} after byte number @var{n}.
@item j @var{n}
Commands appearing after @code{j}-command have no effect concerning bytes 0-@var{n} of the block.
@item l @var{n}
Commands appearing after @code{l}-command have no effect concerning bytes starting from the byte number @var{n} of the block.
@item u @var{n} @var{c}
All bytes from start of the block to offset @var{n} are replaced by @var{c}.
@item f @var{n} @var{c}
All bytes starting from offset @var{n} to the end of the block are replaced by @var{c}.
@item p @var{format}
Contents of block is printed in formats specified by @var{format}. @var{format} can contain following format codes:
@table @var
@item H
Hexadecimal.
@item D
Decimal.
@item O
Octal.
@item A
Ascii, nonprintable characters are printed as space.
@item B
Binary.
@end table
@var{format} can contain several codes, values are then separated by hyphen.
@item r @var{n} @var{string}
Replace bytes with @var{string} starting at the byte number @var{n} of the block.
@item s/@var{search}/@var{replace}/
All occurences of @var{search} are replaced by @var{replace}. @var{replace} can be empty. Separator @code{/} can be replaced by any
character not present in @var{search} or @var{replace}.
@item w @file{file}
Contents of blocks are written to file @file{file}. @strong{Note}: Data inserted by commands @code{A}, @code{I},
@code{>} and @code{<}
are written to file @file{file} and @code{j} and @code{l} commands have no effect on @code{w}-commands. Zero size files are not preserved.@*
Filename can contain format string @code{%B} or @code{%nB}, these format strings are replace by current block number (starting from one), causing every block to have it's own file.
In @code{%nB}, the @code{n} is field width in range 0-99. If @code{n} has a leading zero, then the block numbers will be left padded with zeroes.
@item y/@var{source}/@var{dest}/
Translate bytes in @var{source} to the corresponding bytes in @var{dest}. @var{source} and @var{dest} must have equal length.
Separator @code{/} can be replaced by any character not present in @var{source} or @var{dest}.
@item & @var{c}
Performs binary and with @var{c} on block contents.
@item | @var{c}
Performs binary or with @var{c} on block contents.
@item ^ @var{c}
Performs exclusive or with @var{c} on block contents.
@item ~
Performs binary negation on block contents.
@item x
Exchange the contents of nibbles (half an octet) of bytes.
@end table
@node Limits, , Commands, Invoking bbe
@section Limitations
@cindex big files
@cindex limits
At least in GNU/Linux @command{bbe} should be able to handle big files (> 4 GB), other
systems are not tested.
There are however, some limitations in block and command definitions:
@table @emph
@item Strings in block definition
@itemx Search string in @code{s} command
are limited to @emph{16384} bytes.
@end table
@node bbe programs, Problems, Invoking bbe, Top
@chapter How @command{bbe} works
@command{bbe} scans the input stream just once, so the last block may differ from the block definition, because @command{bbe} doesn't
'peek' the end of the input stream. Last block may be shorter than defined, e.g. if block is defined as @samp{/string/:128}
and if the end of input stream is found before 128'th byte of the last block is reached, the last block remains shorter.
@subheading Basic execution cycle:
@enumerate
@item
Start of the block is searched. If found, data before block is written to output stream (unless @option{-s} is defined) and
step 2 is executed.
@item
Block commands affecting the start of the block (@code{I}, @code{D}, @code{J}, @code{N}, @code{F}, @code{>} and @code{B}) are executed.
@item
The block is scanned byte by byte and all byte commands (lower case letters) are executed.
@strong{Note}: Commands are executed on results of previous commands, if e.g. the first byte of the block is deleted,
the following commands don't 'see' the removed byte.
@item
When end of the block is reached the end of the block commands (@code{A} and @code{<}) are executed.
@item
Next block is searched, data between the blocks, if not suppressed with @option{-s}, is written to output stream.
@end enumerate
@subheading Few examples:
@table @samp
@item echo "The quick brown fox jumps over a lazy dog" | bbe -b "/The/:21" -e "j 4" -e "s/ /X/"
Output is
@example
The quickXbrownXfoxXjumps over a lazy dog
@end example
The only block in this is
@example
The quick brown fox j
@end example
All spaces in the block are converted to X's, before conversion first 4 bytes are skipped.@*
@item echo "The quick brown fox jumps over a lazy dog" | bbe -b ":/ /" -e "J 1" -e "A \x0a"
Output is:
@example
The quick
brown
fox
jumps
over
a
lazy
dog
@end example
All blocks end at space, a newline character is inserted after every block except the first block.@*
@item echo "The quick brown fox jumps over a lazy dog" | bbe -e "r 4 fast\x20" -e "s/f/c/"
Output is:
@example
The cast brown cox jumps over a lazy dog
@end example
Also the @code{f} in @code{fast} is converted to @code{c}.@*
@item echo "1234567890" | bbe -b ":1" -e "L 9" -e "A -"
Output is
@example
1-2-3-4-5-6-7-8-9-0
@end example
Hyphen is inserted after every 1 byte long block,but not after 9'th block.@*
@item bbe -s -b "/First line/:/Last line/" /tmp/text
Print lines between sentences @samp{First line} and @samp{Last line}.@*
@item bbe -s -b "%<a %:%</a>%" -e "s/\x0a/ /" -e "A \n" ./index.html
Extract all links from @file{./index.html}. To get one link per line,
all newlines are converted to spaces and newline is added after every link.@*
@item bbe -b "/\x5f\x28\x02/:10" -s -e "F d" -e "p h" -e "A \n" ./bindata
10 bytes long sequences starting with values @code{x5f} @code{x28} and @code{x02} are printed as hex values.
Also the file offset is printed before each sequence and new line is added after every sequence.
Example output:
@example
52688:x5f x28 x02 x32 x36 x5f x81 x64 x01 x93
68898:x5f x28 x02 x39 x46 x5f x81 x64 x41 x05
69194:x5f x28 x02 x42 x36 x5f x81 x64 x41 x05
@end example
@*
@item bbe -b "/Linux/:5" -s -e "N;D;A \x0a" /bin/* | uniq
Print the file names of those programs in /bin directory which contains word @samp{Linux}.
Example output:
@example
/bin/loadkeys:
/bin/mkbimage:
/bin/ps:
/bin/uname:
@end example
@*
@item bbe -b "/\x5f\x81\x18\x06/:10" -s -e "B d;d 0 4;c BCD ASC;A \n" ./bindata
Print BCD numbers and their block numbers in ascii format. Numbers start with sequence @code{x5f} @code{x81} @code{x18} @code{x06}.
The start sequence is not printed.@*
@item bbe -b "/\x5f/:2" -e "j 1;& \xf0" -o newdata bindata
The least significant nybble of bytes after @code{x5f} is cleared.@*
@item bbe -b "/\xff\xd8\xff/:/\xff\xd9/" -s -e "w pic%02B.jpg" -o /dev/null manual.pdf
Extract jpg-images from pdf-file to separate jpg-files (assuming that the jpg start/stop sequences
does not appear in other context than jpg-images).
Files will be named as @file{pic01.jpg}, @file{pic02.jpg}, @file{pic03.jpg},@dots{} @*
@item bbe -b "_<body>_:_</body>_" -s -o temp nicebody.html
@itemx bbe -b "_<body>_:_</body>_" -e "D;< temp" -o tmpindex.html index.html
@itemx mv tmpindex.html index.html
The body part of the html-document @file{index.html} is replaced by the body of the document @file{nicebody.html}.
@end table
@node Problems, , bbe programs, Top
@chapter Reporting Bugs
@cindex bugs
@cindex problems
If you find a bug in @command{bbe}, please send electronic mail to
@email{tjsa@@iki.fi}. Include the version number, which you can find by
running @w{@samp{bbe --version}}. Also include in your message the
output that the program produced and the output you expected.@refill
If you have other questions, comments or suggestions about
@command{bbe}, contact the author via electronic mail to
@email{tjsa@@iki.fi}. The author will try to help you out, although he
may not have time to fix your problems.
@contents
@bye
|