Commits · 83f68a013e825dffb33f381d3a9af18d827ba094 · Markus Becker / bytes-sgx

Jul 13, 2018

Implement IntoBuf for mut slices. (#214) · 83f68a01

Rafael Ávila de Espíndola authored 6 years ago

With this if foo is a mutable slice, it is possible to do

foo.into_buf().put_u32_le(42);

Before this patch into_buf would create a Cursor<&'a [u8]> and it
would not be possible to write into it.

83f68a01

Apr 27, 2018

Improve performance of Buf::get_*() (#195) · 51e435b7

kohensu authored 6 years ago

The new implementation tries to get the data directly from bytes() (this is
possible most of the time) and if there is not enough data in bytes() use the
previous code: copy the needed bytes in a temporary buffer before returning
the data

Here the bench results:
                               Before                After           x-faster
get_f32::cursor             64 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.2
get_f32::tbuf_1             77 ns/iter (+/- 1)    34 ns/iter (+/- 0)    2.3
get_f32::tbuf_1_costly      87 ns/iter (+/- 0)    62 ns/iter (+/- 0)    1.4
get_f32::tbuf_2            151 ns/iter (+/- 18)  160 ns/iter (+/- 1)    0.9
get_f32::tbuf_2_costly     180 ns/iter (+/- 2)   187 ns/iter (+/- 2)    1.0

get_f64::cursor             67 ns/iter (+/- 0)    21 ns/iter (+/- 0)    3.2
get_f64::tbuf_1             80 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.3
get_f64::tbuf_1_costly      82 ns/iter (+/- 3)    60 ns/iter (+/- 0)    1.4
get_f64::tbuf_2            154 ns/iter (+/- 1)   164 ns/iter (+/- 0)    0.9
get_f64::tbuf_2_costly     170 ns/iter (+/- 2)   187 ns/iter (+/- 1)    0.9

get_u16::cursor             66 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.3
get_u16::tbuf_1             77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u16::tbuf_1_costly      85 ns/iter (+/- 2)    62 ns/iter (+/- 0)    1.4
get_u16::tbuf_2            147 ns/iter (+/- 0)   154 ns/iter (+/- 0)    1.0
get_u16::tbuf_2_costly     160 ns/iter (+/- 1)   177 ns/iter (+/- 0)    0.9

get_u32::cursor             64 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.2
get_u32::tbuf_1             77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u32::tbuf_1_costly      91 ns/iter (+/- 2)    63 ns/iter (+/- 0)    1.4
get_u32::tbuf_2            151 ns/iter (+/- 40)  157 ns/iter (+/- 0)    1.0
get_u32::tbuf_2_costly     162 ns/iter (+/- 0)   180 ns/iter (+/- 0)    0.9

get_u64::cursor             67 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.4
get_u64::tbuf_1             78 ns/iter (+/- 0)    35 ns/iter (+/- 1)    2.2
get_u64::tbuf_1_costly      87 ns/iter (+/- 1)    59 ns/iter (+/- 1)    1.5
get_u64::tbuf_2            154 ns/iter (+/- 0)   160 ns/iter (+/- 0)    1.0
get_u64::tbuf_2_costly     168 ns/iter (+/- 0)   184 ns/iter (+/- 0)    0.9

get_u8::cursor              64 ns/iter (+/- 0)    19 ns/iter (+/- 0)    3.4
get_u8::tbuf_1              77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u8::tbuf_1_costly       68 ns/iter (+/- 0)    51 ns/iter (+/- 0)    1.3
get_u8::tbuf_2              85 ns/iter (+/- 0)    43 ns/iter (+/- 0)    2.0
get_u8::tbuf_2_costly       75 ns/iter (+/- 0)    61 ns/iter (+/- 0)    1.2
get_u8::option              77 ns/iter (+/- 0)    59 ns/iter (+/- 0)    1.3

Improvement on the basic std::Cursor implementation are clearly visible.

Other implementations are specific to the bench tests and just map a static
slice. Different variant are:
 - tbuf_1: only one call of 'bytes()' is needed.
 - tbuf_2: two calls of 'bytes()' is needed to read more than one byte.
 - _costly version are implemented with #[inline(never)] on 'bytes()',
   'remaining()' and 'advance()'.

The cases that are slower (slightly) correspond to implementations that are not
really realistic: more than one byte is never possible in one time

Unverified

51e435b7

Improve performance of Buf::get_*() (#195) · e4447220

kohensu authored 6 years ago

The new implementation tries to get the data directly from bytes() (this is
possible most of the time) and if there is not enough data in bytes() use the
previous code: copy the needed bytes in a temporary buffer before returning
the data

Here the bench results:
                               Before                After           x-faster
get_f32::cursor             64 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.2
get_f32::tbuf_1             77 ns/iter (+/- 1)    34 ns/iter (+/- 0)    2.3
get_f32::tbuf_1_costly      87 ns/iter (+/- 0)    62 ns/iter (+/- 0)    1.4
get_f32::tbuf_2            151 ns/iter (+/- 18)  160 ns/iter (+/- 1)    0.9
get_f32::tbuf_2_costly     180 ns/iter (+/- 2)   187 ns/iter (+/- 2)    1.0

get_f64::cursor             67 ns/iter (+/- 0)    21 ns/iter (+/- 0)    3.2
get_f64::tbuf_1             80 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.3
get_f64::tbuf_1_costly      82 ns/iter (+/- 3)    60 ns/iter (+/- 0)    1.4
get_f64::tbuf_2            154 ns/iter (+/- 1)   164 ns/iter (+/- 0)    0.9
get_f64::tbuf_2_costly     170 ns/iter (+/- 2)   187 ns/iter (+/- 1)    0.9

get_u16::cursor             66 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.3
get_u16::tbuf_1             77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u16::tbuf_1_costly      85 ns/iter (+/- 2)    62 ns/iter (+/- 0)    1.4
get_u16::tbuf_2            147 ns/iter (+/- 0)   154 ns/iter (+/- 0)    1.0
get_u16::tbuf_2_costly     160 ns/iter (+/- 1)   177 ns/iter (+/- 0)    0.9

get_u32::cursor             64 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.2
get_u32::tbuf_1             77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u32::tbuf_1_costly      91 ns/iter (+/- 2)    63 ns/iter (+/- 0)    1.4
get_u32::tbuf_2            151 ns/iter (+/- 40)  157 ns/iter (+/- 0)    1.0
get_u32::tbuf_2_costly     162 ns/iter (+/- 0)   180 ns/iter (+/- 0)    0.9

get_u64::cursor             67 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.4
get_u64::tbuf_1             78 ns/iter (+/- 0)    35 ns/iter (+/- 1)    2.2
get_u64::tbuf_1_costly      87 ns/iter (+/- 1)    59 ns/iter (+/- 1)    1.5
get_u64::tbuf_2            154 ns/iter (+/- 0)   160 ns/iter (+/- 0)    1.0
get_u64::tbuf_2_costly     168 ns/iter (+/- 0)   184 ns/iter (+/- 0)    0.9

get_u8::cursor              64 ns/iter (+/- 0)    19 ns/iter (+/- 0)    3.4
get_u8::tbuf_1              77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u8::tbuf_1_costly       68 ns/iter (+/- 0)    51 ns/iter (+/- 0)    1.3
get_u8::tbuf_2              85 ns/iter (+/- 0)    43 ns/iter (+/- 0)    2.0
get_u8::tbuf_2_costly       75 ns/iter (+/- 0)    61 ns/iter (+/- 0)    1.2
get_u8::option              77 ns/iter (+/- 0)    59 ns/iter (+/- 0)    1.3

Improvement on the basic std::Cursor implementation are clearly visible.

Other implementations are specific to the bench tests and just map a static
slice. Different variant are:
 - tbuf_1: only one call of 'bytes()' is needed.
 - tbuf_2: two calls of 'bytes()' is needed to read more than one byte.
 - _costly version are implemented with #[inline(never)] on 'bytes()',
   'remaining()' and 'advance()'.

The cases that are slower (slightly) correspond to implementations that are not
really realistic: more than one byte is never possible in one time

e4447220

Mar 12, 2018

Fix `copy_to_slice` to use correct increment var · ebe52273

Carl Lerche authored 7 years ago

This patch fixes the `copy_to_slice` function, rectifying the logic.
However, the incorrect code does not result in incorrect behavior as the
only case `cnt != src.len()` is during the final iteration, and since
`src.len()` is greater than `cnt` in that case, `off` will be
incremented by too much, but this will still trigger the `off <
dst.len()` condition.

The only danger is `src.len()` could cause an overflow.

Unverified

ebe52273

Remove ByteOrder generic methods from Buf and BufMut (#187) · 025d5334

Sean McArthur authored 7 years ago

* make Buf and BufMut usable as trait objects

- All the `get_*` and `put_*` methods that take `T: ByteOrder` have
  a `where Self: Sized` bound added, so that they are only usable from
  sized types. It was impossible to make `Buf` or `BufMut` into trait
  objects before, so this change doesn't break anyone.
- Add `get_n_be`/`get_n_le`/`put_n_be`/`put_n_le` methods that can be
  used on trait objects.
- Deprecate the export of `ByteOrder` and methods generic on it.

* remove deprecated ByteOrder methods

Removes the `_be` suffix from all methods, implying that the default
people should use is network endian.

025d5334

Make Buf and BufMut usable as trait objects (#186) · ce79f0a2

Sean McArthur authored 7 years ago

- All the `get_*` and `put_*` methods that take `T: ByteOrder` have
  a `where Self: Sized` bound added, so that they are only usable from
  sized types. It was impossible to make `Buf` or `BufMut` into trait
  objects before, so this change doesn't break anyone.
- Add `get_n_be`/`get_n_le`/`put_n_be`/`put_n_le` methods that can be
  used on trait objects.
- Deprecate the export of `ByteOrder` and methods generic on it.

Fixes #163

ce79f0a2

Jan 26, 2018

Update iovec dependency (#179) · 3d169f1f

Carl Lerche authored 7 years ago

Update to match master version of IoVec (0.2.0?), using
IoVec/IoVecMut instead of &IoVec and &mut IoVec.

Unverified

3d169f1f

Jun 27, 2017
- Fix index-oob panic in Take::bytes (#138) · 7ed78cef
  Dan Burkert authored 7 years ago
  
  The panic happens when `inner.bytes()` returns a slice smaller than the limit.
  7ed78cef
May 24, 2017
- impl ExactSizeIterator for Iter<T: Buf> (#127) · 2b0602e7
  brianwp authored 7 years ago
  
  2b0602e7
Apr 30, 2017
- Vec::advance_mut can advance past the end of the buffer (#108) · 30bd7c1f
  Dan Burkert authored 7 years ago
  
  30bd7c1f
Mar 19, 2017

Clarify when `BufMut::bytes_mut` can return &[] · bed128b2
Carl Lerche authored 8 years ago
```
Closes #79
```
bed128b2

Add inline attributes to Vec's MutBuf methods (#80) · 5a265cc8

Dan Burkert authored 8 years ago

I found this significantly improved a
[benchmark](https://gist.github.com/danburkert/34a7d6680d97bc86dca7f396eb8d0abf)
which calls `bytes_mut`, writes 1 byte, and advances the pointer with
`advance_mut` in a pretty tight loop. In particular, it seems to be the
inline annotation on `bytes_mut` which had the most effect. I also took
the opportunity to simplify the bounds checking in advance_mut.

before:

```
test encode_varint_small  ... bench:         540 ns/iter (+/- 85) = 1481 MB/s
```

after:

```
test encode_varint_small  ... bench:         422 ns/iter (+/- 24) = 1895 MB/s
```

As you can see, the variance is also significantly improved.

Interestingly, I tried to change the last statement in `bytes_mut` from

```
&mut slice::from_raw_parts_mut(ptr, cap)[len..]
```

to

```
slice::from_raw_parts_mut(ptr.offset(len as isize), cap - len)
```

but, this caused a very measurable perf regression (almost completely
negating the gains from marking bytes_mut inline).

5a265cc8

Clarify BufMut::advance_mut docs (#78) · 4fe4e942
Dan Burkert authored 8 years ago
```
Also fixes an issue with a line wrap in the middle of an inline code
block.
```
4fe4e942

Mar 16, 2017
- Tweak docs (#76) · 99fba239
  Carl Lerche authored 8 years ago
  
  99fba239
Mar 07, 2017

Remove buf::Source in favor of buf::IntoBuf · 06b94c55

Carl Lerche authored 8 years ago

The `Source` trait was essentially covering the same case as `IntoBuf`,
so remove it.

While technically a breaking change, this should not have any impact due
to:

1) There are no reverse dependencies that currently depend on `bytes`
2) Source was not supposed to be implemented externally
3) IntoBuf provides the same implementations as `Source`

Given these points, the change should be safe to apply.

06b94c55

Provide Debug impls for all types · d70f575a
Carl Lerche authored 8 years ago

d70f575a

Mar 02, 2017
- Clarify API edge cases · d0142aa6
  Carl Lerche authored 8 years ago
  
  d0142aa6
Mar 01, 2017
- Implement `chain` combinator for `Buf` · 94396162
  Carl Lerche authored 8 years ago
  
  94396162
- Add vectored support to Buf and BufMut · bb9bf7ee
  Carl Lerche authored 8 years ago
  
  bb9bf7ee
- Move stray impls into appropriate file · 4462056e
  Carl Lerche authored 8 years ago
  
  4462056e
- Expand object-safe impls slightly · d19c9290
  Alex Crichton authored 8 years ago
  
  Add `?Sized` bounds to work for DST objects and also add impls for `Box` as well as `&mut`
  d19c9290
- Implement iterator adapter for `Buf` · 8fec8a92
  Carl Lerche authored 8 years ago
  
  8fec8a92
- Implement `FromBuf` and `Buf::collect` · 4f8c5651
  Carl Lerche authored 8 years ago
  
  Enables collecting the contents of a `Buf` value into a relevant concrete buffer implementation.
  4f8c5651
- Implement IntoBuf for T: Buf · e842296c
  Carl Lerche authored 8 years ago
  
  e842296c
- Don't re-export everything from `buf` module · f7f8d6c9
  Carl Lerche authored 8 years ago
  
  f7f8d6c9
Feb 28, 2017
- Split buf.rs into separate files · 4466b75a
  Carl Lerche authored 8 years ago
  
  4466b75a
Feb 17, 2017
- Polish API surface · 646624c1
  Carl Lerche authored 8 years ago
  
  646624c1
Feb 16, 2017
- Remove Take/TakeMut · bababa87
  Carl Lerche authored 8 years ago
  
  bababa87
- Write docs and remove unecessary fns and types · 0e0066e8
  Carl Lerche authored 8 years ago
  
  0e0066e8
Feb 15, 2017

Provide two versions of drain_to and split_off · 4c6ebeba

Carl Lerche authored 8 years ago

* `drain_to` and `split_off` take &self and return Bytes.
* `drain_to_mut` and `split_off_mut` take &mut self and return BytesMut

4c6ebeba

Feb 03, 2017
- Add explicit inlines · accc8a46
  Carl Lerche authored 8 years ago
  
  accc8a46
Nov 22, 2016
- Fix BytesMut refcounting · 93c08064
  Carl Lerche authored 8 years ago
  
  93c08064
Nov 21, 2016
- added clone to ByteBuf and BytesMut along with simple clone test · 2b796d40
  Rick Richardson authored 8 years ago
  
  2b796d40
Nov 03, 2016
- Add more conversion impls · 4886b445
  Carl Lerche authored 8 years ago
  
  4886b445
Nov 02, 2016

Remove default for SliceBuf<T> · 11fe277c
Carl Lerche authored 8 years ago

11fe277c

Restructure and trim down the library · 57e84f26

Carl Lerche authored 8 years ago

This commit is a significant overhaul of the library in an effort to head
towards a stable API. The rope implementation as well as a number of buffer
implementations have been removed from the library and will live at
https://github.com/carllerche/bytes-more while they incubate.

**Bytes / BytesMut**

`Bytes` is now an atomic ref counted byte slice. As it is contigous, it offers
a richer API than before.

`BytesMut` is a mutable variant. It is safe by ensuring that it is the only
handle to a given byte slice.

**AppendBuf -> ByteBuf**

`AppendBuf` has been replaced by `ByteBuf`. The API is not identical, but is
close enough to be considered a suitable replacement.

**Removed types**

The following types have been removed in favor of living in bytes-more

* RingBuf
* BlockBuf
* `Bytes` as a rope implementation
* ReadExt
* WriteExt

57e84f26

Sep 23, 2016
- Reorganize crate · 98e0d954
  Carl Lerche authored 8 years ago
  
  98e0d954
- Add more Buf helpers · d05bfb63
  Carl Lerche authored 8 years ago
  
  d05bfb63
Sep 22, 2016
- Add take fn to Buf & MutBuf · 3c58b0c7
  Carl Lerche authored 8 years ago
  
  3c58b0c7
Sep 20, 2016
- Rename RingBuf::new -> with_capacity · 41059012
  Carl Lerche authored 8 years ago
  
  41059012