Commits · d98f49e67bdd679d200a59f561c2429e0676f040 · Markus Becker / bytes-sgx

Jun 18, 2018

Merge branch 'v0.4.x' · d98f49e6
Carl Lerche authored 6 years ago

Unverified

d98f49e6

Clarify license as MIT (#216) · a6b98442

Carl Lerche authored 6 years ago

The intent of the license was to dual license MIT & Apache 2.0. However,
the messaging was copy / pasted from rust-lang.

Clarify the license as exclusively MIT.

Fixes #215

Unverified

a6b98442

May 25, 2018
- Merge branch 'v0.4.x' · 8d5b46cd
  Carl Lerche authored 6 years ago
  
  Unverified
  
  8d5b46cd
- Bump version to v0.4.8 (#206) · 406b048a
  Carl Lerche authored 6 years ago
  
  Unverified
  
  406b048a
- Filter out tsan warnings in test harness (#205) · 8c041142
  Carl Lerche authored 6 years ago
  
  Unverified
  
  8c041142
- Filter out tsan warnings in test harness (#205) · 3557bafa
  Carl Lerche authored 6 years ago
  
  Unverified
  
  3557bafa
- Merge branch 'v0.4.x' · c87739bb
  Carl Lerche authored 6 years ago
  
  Unverified
  
  c87739bb
- Added a resize function for BytesMut (#203) · e9a70986
  Luke Horsley authored 6 years ago
  
  e9a70986
- Use sanitizers in CI (#204) · 32ea8281
  Carl Lerche authored 6 years ago
  
  Unverified
  
  32ea8281
May 24, 2018

Recycle space when reserving from Vec-backed Bytes (#197) · dfce95b8

Noah Zentzis authored 6 years ago

* Recycle space when reserving from Vec-backed Bytes

BytesMut::reserve, when called on a BytesMut instance which is backed by
a non-shared Vec<u8>, would previously just delegate to Vec::reserve
regardless of the current location in the buffer. If the Bytes is
actually the trailing component of a larger Vec, then the unused space
won't be recycled. In applications which continually move the pointer
forward to consume data as it comes in, this can cause the underlying
buffer to get extremely large.

This commit checks whether there's extra space at the start of the
backing Vec in this case, and reuses the unused space if possible
instead of allocating.

* Avoid excessive copying when reusing Vec space

Only reuse space in a Vec-backed Bytes when doing so would gain back
more than half of the current capacity. This avoids excessive copy
operations when a large buffer is almost (but not completely) full.

Unverified

dfce95b8

Recycle space when reserving from Vec-backed Bytes (#197) · 2d95683b

Noah Zentzis authored 6 years ago

* Recycle space when reserving from Vec-backed Bytes

BytesMut::reserve, when called on a BytesMut instance which is backed by
a non-shared Vec<u8>, would previously just delegate to Vec::reserve
regardless of the current location in the buffer. If the Bytes is
actually the trailing component of a larger Vec, then the unused space
won't be recycled. In applications which continually move the pointer
forward to consume data as it comes in, this can cause the underlying
buffer to get extremely large.

This commit checks whether there's extra space at the start of the
backing Vec in this case, and reuses the unused space if possible
instead of allocating.

* Avoid excessive copying when reusing Vec space

Only reuse space in a Vec-backed Bytes when doing so would gain back
more than half of the current capacity. This avoids excessive copy
operations when a large buffer is almost (but not completely) full.

2d95683b

May 11, 2018
- Fix panic in FromIterator for BytesMut · b68fa46e
  Carl Lerche authored 6 years ago
  
  Unverified
  
  b68fa46e
Apr 27, 2018

Bump version to v0.4.7 · ef09e98f
Carl Lerche authored 6 years ago

Unverified

ef09e98f
Merge branch 'v0.4.x' · d656d371
Carl Lerche authored 6 years ago

Unverified

d656d371

Improve performance of Buf::get_*() (#195) · 51e435b7

kohensu authored 6 years ago

The new implementation tries to get the data directly from bytes() (this is
possible most of the time) and if there is not enough data in bytes() use the
previous code: copy the needed bytes in a temporary buffer before returning
the data

Here the bench results:
                               Before                After           x-faster
get_f32::cursor             64 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.2
get_f32::tbuf_1             77 ns/iter (+/- 1)    34 ns/iter (+/- 0)    2.3
get_f32::tbuf_1_costly      87 ns/iter (+/- 0)    62 ns/iter (+/- 0)    1.4
get_f32::tbuf_2            151 ns/iter (+/- 18)  160 ns/iter (+/- 1)    0.9
get_f32::tbuf_2_costly     180 ns/iter (+/- 2)   187 ns/iter (+/- 2)    1.0

get_f64::cursor             67 ns/iter (+/- 0)    21 ns/iter (+/- 0)    3.2
get_f64::tbuf_1             80 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.3
get_f64::tbuf_1_costly      82 ns/iter (+/- 3)    60 ns/iter (+/- 0)    1.4
get_f64::tbuf_2            154 ns/iter (+/- 1)   164 ns/iter (+/- 0)    0.9
get_f64::tbuf_2_costly     170 ns/iter (+/- 2)   187 ns/iter (+/- 1)    0.9

get_u16::cursor             66 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.3
get_u16::tbuf_1             77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u16::tbuf_1_costly      85 ns/iter (+/- 2)    62 ns/iter (+/- 0)    1.4
get_u16::tbuf_2            147 ns/iter (+/- 0)   154 ns/iter (+/- 0)    1.0
get_u16::tbuf_2_costly     160 ns/iter (+/- 1)   177 ns/iter (+/- 0)    0.9

get_u32::cursor             64 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.2
get_u32::tbuf_1             77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u32::tbuf_1_costly      91 ns/iter (+/- 2)    63 ns/iter (+/- 0)    1.4
get_u32::tbuf_2            151 ns/iter (+/- 40)  157 ns/iter (+/- 0)    1.0
get_u32::tbuf_2_costly     162 ns/iter (+/- 0)   180 ns/iter (+/- 0)    0.9

get_u64::cursor             67 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.4
get_u64::tbuf_1             78 ns/iter (+/- 0)    35 ns/iter (+/- 1)    2.2
get_u64::tbuf_1_costly      87 ns/iter (+/- 1)    59 ns/iter (+/- 1)    1.5
get_u64::tbuf_2            154 ns/iter (+/- 0)   160 ns/iter (+/- 0)    1.0
get_u64::tbuf_2_costly     168 ns/iter (+/- 0)   184 ns/iter (+/- 0)    0.9

get_u8::cursor              64 ns/iter (+/- 0)    19 ns/iter (+/- 0)    3.4
get_u8::tbuf_1              77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u8::tbuf_1_costly       68 ns/iter (+/- 0)    51 ns/iter (+/- 0)    1.3
get_u8::tbuf_2              85 ns/iter (+/- 0)    43 ns/iter (+/- 0)    2.0
get_u8::tbuf_2_costly       75 ns/iter (+/- 0)    61 ns/iter (+/- 0)    1.2
get_u8::option              77 ns/iter (+/- 0)    59 ns/iter (+/- 0)    1.3

Improvement on the basic std::Cursor implementation are clearly visible.

Other implementations are specific to the bench tests and just map a static
slice. Different variant are:
 - tbuf_1: only one call of 'bytes()' is needed.
 - tbuf_2: two calls of 'bytes()' is needed to read more than one byte.
 - _costly version are implemented with #[inline(never)] on 'bytes()',
   'remaining()' and 'advance()'.

The cases that are slower (slightly) correspond to implementations that are not
really realistic: more than one byte is never possible in one time

Unverified

51e435b7

impl BorrowMut for BytesMut (#185) (#192) · 15050b1d
Alan Somers authored 6 years ago

15050b1d

Improve performance of Buf::get_*() (#195) · e4447220

kohensu authored 6 years ago

The new implementation tries to get the data directly from bytes() (this is
possible most of the time) and if there is not enough data in bytes() use the
previous code: copy the needed bytes in a temporary buffer before returning
the data

Here the bench results:
                               Before                After           x-faster
get_f32::cursor             64 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.2
get_f32::tbuf_1             77 ns/iter (+/- 1)    34 ns/iter (+/- 0)    2.3
get_f32::tbuf_1_costly      87 ns/iter (+/- 0)    62 ns/iter (+/- 0)    1.4
get_f32::tbuf_2            151 ns/iter (+/- 18)  160 ns/iter (+/- 1)    0.9
get_f32::tbuf_2_costly     180 ns/iter (+/- 2)   187 ns/iter (+/- 2)    1.0

get_f64::cursor             67 ns/iter (+/- 0)    21 ns/iter (+/- 0)    3.2
get_f64::tbuf_1             80 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.3
get_f64::tbuf_1_costly      82 ns/iter (+/- 3)    60 ns/iter (+/- 0)    1.4
get_f64::tbuf_2            154 ns/iter (+/- 1)   164 ns/iter (+/- 0)    0.9
get_f64::tbuf_2_costly     170 ns/iter (+/- 2)   187 ns/iter (+/- 1)    0.9

get_u16::cursor             66 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.3
get_u16::tbuf_1             77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u16::tbuf_1_costly      85 ns/iter (+/- 2)    62 ns/iter (+/- 0)    1.4
get_u16::tbuf_2            147 ns/iter (+/- 0)   154 ns/iter (+/- 0)    1.0
get_u16::tbuf_2_costly     160 ns/iter (+/- 1)   177 ns/iter (+/- 0)    0.9

get_u32::cursor             64 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.2
get_u32::tbuf_1             77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u32::tbuf_1_costly      91 ns/iter (+/- 2)    63 ns/iter (+/- 0)    1.4
get_u32::tbuf_2            151 ns/iter (+/- 40)  157 ns/iter (+/- 0)    1.0
get_u32::tbuf_2_costly     162 ns/iter (+/- 0)   180 ns/iter (+/- 0)    0.9

get_u64::cursor             67 ns/iter (+/- 0)    20 ns/iter (+/- 0)    3.4
get_u64::tbuf_1             78 ns/iter (+/- 0)    35 ns/iter (+/- 1)    2.2
get_u64::tbuf_1_costly      87 ns/iter (+/- 1)    59 ns/iter (+/- 1)    1.5
get_u64::tbuf_2            154 ns/iter (+/- 0)   160 ns/iter (+/- 0)    1.0
get_u64::tbuf_2_costly     168 ns/iter (+/- 0)   184 ns/iter (+/- 0)    0.9

get_u8::cursor              64 ns/iter (+/- 0)    19 ns/iter (+/- 0)    3.4
get_u8::tbuf_1              77 ns/iter (+/- 0)    35 ns/iter (+/- 0)    2.2
get_u8::tbuf_1_costly       68 ns/iter (+/- 0)    51 ns/iter (+/- 0)    1.3
get_u8::tbuf_2              85 ns/iter (+/- 0)    43 ns/iter (+/- 0)    2.0
get_u8::tbuf_2_costly       75 ns/iter (+/- 0)    61 ns/iter (+/- 0)    1.2
get_u8::option              77 ns/iter (+/- 0)    59 ns/iter (+/- 0)    1.3

Improvement on the basic std::Cursor implementation are clearly visible.

Other implementations are specific to the bench tests and just map a static
slice. Different variant are:
 - tbuf_1: only one call of 'bytes()' is needed.
 - tbuf_2: two calls of 'bytes()' is needed to read more than one byte.
 - _costly version are implemented with #[inline(never)] on 'bytes()',
   'remaining()' and 'advance()'.

The cases that are slower (slightly) correspond to implementations that are not
really realistic: more than one byte is never possible in one time

e4447220

Add a bench for Buf::get_*() (#194) · d5610062
kohensu authored 6 years ago

d5610062

Mar 12, 2018

Introduce Bytes::to_mut (#188) · 2c27ddaf
Anthony Ramine authored 7 years ago

2c27ddaf
impl BorrowMut for BytesMut (#185) · ae1b4549
Alan Somers authored 7 years ago

ae1b4549

Fix `copy_to_slice` to use correct increment var · ebe52273

Carl Lerche authored 7 years ago

This patch fixes the `copy_to_slice` function, rectifying the logic.
However, the incorrect code does not result in incorrect behavior as the
only case `cnt != src.len()` is during the final iteration, and since
`src.len()` is greater than `cnt` in that case, `off` will be
incremented by too much, but this will still trigger the `off <
dst.len()` condition.

The only danger is `src.len()` could cause an overflow.

Unverified

ebe52273

Merge remote-tracking branch 'origin/v0.4.x' · bd4630a3
Carl Lerche authored 7 years ago

Unverified

bd4630a3

Remove ByteOrder generic methods from Buf and BufMut (#187) · 025d5334

Sean McArthur authored 7 years ago

* make Buf and BufMut usable as trait objects

- All the `get_*` and `put_*` methods that take `T: ByteOrder` have
  a `where Self: Sized` bound added, so that they are only usable from
  sized types. It was impossible to make `Buf` or `BufMut` into trait
  objects before, so this change doesn't break anyone.
- Add `get_n_be`/`get_n_le`/`put_n_be`/`put_n_le` methods that can be
  used on trait objects.
- Deprecate the export of `ByteOrder` and methods generic on it.

* remove deprecated ByteOrder methods

Removes the `_be` suffix from all methods, implying that the default
people should use is network endian.

025d5334

Make Buf and BufMut usable as trait objects (#186) · ce79f0a2

Sean McArthur authored 7 years ago

- All the `get_*` and `put_*` methods that take `T: ByteOrder` have
  a `where Self: Sized` bound added, so that they are only usable from
  sized types. It was impossible to make `Buf` or `BufMut` into trait
  objects before, so this change doesn't break anyone.
- Add `get_n_be`/`get_n_le`/`put_n_be`/`put_n_le` methods that can be
  used on trait objects.
- Deprecate the export of `ByteOrder` and methods generic on it.

Fixes #163

ce79f0a2

Feb 26, 2018
- Bytes::unsplit (#182) · ff7c0a1d
  Alan Somers authored 7 years ago
  
  Add `Bytes::unsplit`, analogous to `BytesMut::unsplit`.
  ff7c0a1d
Jan 29, 2018
- Merge remote-tracking branch 'origin/v0.4.x' · 4b68ef40
  Carl Lerche authored 7 years ago
  
  4b68ef40
- Have Travis build WASM target (#180) · 86c83959
  Carl Lerche authored 7 years ago
  
  Unverified
  
  86c83959
- Bump version · 355a6629
  Carl Lerche authored 7 years ago
  
  355a6629
Jan 26, 2018

Update iovec dependency (#179) · 3d169f1f

Carl Lerche authored 7 years ago

Update to match master version of IoVec (0.2.0?), using
IoVec/IoVecMut instead of &IoVec and &mut IoVec.

Unverified

3d169f1f

Jan 15, 2018
- Optionally disable inlining (#176) · c316c243
  Alan Somers authored 7 years ago
  
  Just document how `from(Vec<u8>)` can be used to disable inlining.
  c316c243
Jan 08, 2018
- Bump version to v0.4.6 · e5c4c602
  Carl Lerche authored 7 years ago
  
  e5c4c602
Jan 06, 2018

Unsplit improvements (#173) · ba9a9753

jq-rs authored 7 years ago

* Handle empty self and other for unsplit.
* Change extend() to extend_from_slice().

ba9a9753

Jan 03, 2018

Optimize shallow_clone for Bytes::split_{off,to} (#92) · 6a3d20bb

Stepan Koltsov authored 7 years ago

If `shallow_clone` is called with `&mut self`, and `Bytes` contains
`Vec`, then expensive CAS can be avoided, because no other thread
have references to this `Bytes` object.

Bench `split_off_and_drop` difference:

Before the diff:

```
test split_off_and_drop             ... bench:      91,858 ns/iter (+/- 17,401)
```

With the diff:

```
test split_off_and_drop             ... bench:      81,162 ns/iter (+/- 17,603)
```

6a3d20bb

Add support for unsplit() to BytesMut (#162) · 2ca61d88

jq-rs authored 7 years ago

Add support for unsplit() to BytesMut which combines splitted contiguous memory blocks efficiently.

2ca61d88

Dec 16, 2017
- Document correct inline capacity in bytes.rs (#171) · 8d6c2b61
  Carl Lerche authored 7 years ago
  
  Fixes #164
  Unverified
  
  8d6c2b61
Dec 13, 2017

Add `advance` on `Bytes` and `BytesMut` (#166) · 02891144

Carl Lerche authored 7 years ago

* Compact Bytes original capacity representation

In order to avoid unnecessary allocations, a `Bytes` structure remembers
the capacity with which it was first created. When a reserve operation
is issued, this original capacity value is used to as a baseline for
reallocating new storage.

Previously, this original capacity value was stored in its raw form. In
other words, the original capacity `usize` was stored as is. In order to
reclaim some `Bytes` internal storage space for additional features,
this original capacity value is compressed from requiring 16 bits to 3.

To do this, instead of storing the exact original capacity. The original
capacity is rounded down to the nearest power of two. If the original
capacity is less than 1024, then it is rounded down to zero. This
roughly means that the original capacity is now stored as a table:

0 => 0
1 => 1k
2 => 2k
3 => 4k
4 => 8k
5 => 16k
6 => 32k
7 => 64k

For the purposes that the original capacity feature was introduced, this
is sufficient granularity.

* Provide `advance` on Bytes and BytesMut

This is the `advance` function that would be part of a `Buf`
implementation. However, `Bytes` and `BytesMut` cannot impl `Buf` until
the next breaking release.

The implementation uses the additional storage made available by the
previous commit to store the number of bytes that the view was advanced.
The `ptr` pointer will point to the start of the window, avoiding any
pointer arithmetic when dereferencing the `Bytes` handle.

Unverified

02891144

Oct 21, 2017
- Get test passing again · 149922d7
  Carl Lerche authored 7 years ago
  
  149922d7
Aug 18, 2017

small fixups in bytes.rs (#145) · 03d501b1

Dan Burkert authored 7 years ago

* Inner: make uninitialized construction explicit
* Remove Inner2
* Remove unnecessary transmutes
* Use AtomicPtr::get_mut where possible
* Some minor tweaks

03d501b1

Aug 17, 2017
- Add `FromIterator` impl (#148) · 34540be5
  Jef authored 7 years ago
  
  34540be5
- print space normally in Debug for Bytes (#155) · cfca1c04
  Sean McArthur authored 7 years ago
  
  cfca1c04