Using the types TransferDescriptor in typescript #348

CMCDragonkai · 2021-03-26T05:18:04Z

I have a worker function like:

  encryptWithPublicKeyAsn1(
    publicKeyAsn1: PublicKeyAsn1,
    plainText: ArrayBuffer
  ): TransferDescriptor<ArrayBuffer> {
    const plainText_ = Buffer.from(plainText);
    const publicKey = keysUtils.publicKeyFromAsn1(publicKeyAsn1);
    const cipherText = keysUtils.encryptWithPublicKey(publicKey, plainText_);
    return Transfer(cipherText.buffer);
  },

However when I call this function from the main side:

      cipherText = await this.workerManager.call(
        async w => {
          const publicKeyAsn1 = keysUtils.publicKeyToAsn1(publicKey);
          return Buffer.from(
            await w.encryptWithPublicKeyAsn1(
              publicKeyAsn1,
              Transfer(plainText.buffer)
            )
          );
        }
      );

There's a type error:

No overload matches this call.
  The last overload gave the following error.
    Argument of type 'TransferDescriptor<ArrayBuffer>' is not assignable to parameter of type 'WithImplicitCoercion<string> | { [Symbol.toPrimitive](hint: "string"): string; }'.
      Property '[Symbol.toPrimitive]' is missing in type 'TransferDescriptor<ArrayBuffer>' but required in type '{ [Symbol.toPrimitive](hint: "string"): string; }'.

The problem appears that the return type is a TransferDescriptor<ArrayBuffer>.

Then when I call it, it must also be ArrayBuffer.

I have to typecast like this:

      cipherText = await this.workerManager.call(
        async w => {
          const publicKeyAsn1 = keysUtils.publicKeyToAsn1(publicKey);
          return Buffer.from(
            await w.encryptWithPublicKeyAsn1(
              publicKeyAsn1,
              Transfer(plainText.buffer) as unknown as ArrayBuffer
            ) as unknown as ArrayBuffer
          );
        }
      );

Is this the right way to do this?

The text was updated successfully, but these errors were encountered:

andywer · 2021-03-27T11:04:59Z

Hey @CMCDragonkai, thanks for reporting! I do think it's an issue with the types.

Workers can return transferables and the main thread will see no difference. The code normalizing the worker's result can be found here.

Now the worker function type defs in src/types/master.ts need to reflect that by applying something like this: type StripTransferDescriptor<T> = T extends TransferDescriptor<infer BaseT> ? BaseT : T

Give me a little bit of time to prepare a PR :)

CMCDragonkai · 2021-07-15T04:41:13Z

Hey @andywer I got around to trying this again and it still doesn't work.

This is the function I have inside my worker:

  encryptWithKey(
    key: ArrayBuffer,
    plainText: ArrayBuffer
  ): TransferDescriptor<ArrayBuffer> {
    const cipherText = utils.encryptWithKey(
      Buffer.from(key),
      Buffer.from(plainText)
    );
    return Transfer(cipherText.buffer);
  },

Am I supposed to have TransferDescriptor on my parameter types as well?

I'm not able to use Transfer on my node buffers nor my node buffer array buffers.

CMCDragonkai · 2021-07-15T04:44:26Z

Do you have an example that uses Node buffers, and not ArrayBuffer?

CMCDragonkai · 2021-07-15T04:49:06Z

This is how I call it:

  cipherText = await workerManager.call(
    async w => {
      const buf = await w.encryptWithKey(
        Transfer(key.buffer),
        Transfer(plainText.buffer)
      );
      return Buffer.from(buf);
    }
  );

There is still a type error on Transfer(plainText.buffer). It says:

Argument of type 'TransferDescriptor<any>' is not assignable to parameter of type 'ArrayBuffer'.
  Type 'TransferDescriptor<any>' is missing the following properties from type 'ArrayBuffer': byteLength, slice, [Symbol.toStringTag]

It seems that either there's some corruption sending a Node buffer's array buffer over and there's also a type error, in that encryptWithkey has 2 ArrayBuffer parameters, but it only makes the first one a TransferDescriptor in typescript inference.

CMCDragonkai · 2021-07-15T05:24:14Z

Ok I think I understood what happened. Node buffers use a large ArrayBuffer. To properly use them I have to also transfer the offset and length information.

However there is in fact another type error. This happens when there are multiple ArrayBuffer parameters in the worker function.

Typescript complains that using Transfer on the second time doesn't match.

For example:

oneForOne(args_0: ArrayBuffer | TransferDescriptor<ArrayBuffer>, args_1: ArrayBuffer): ObservablePromise<ArrayBuffer>

That's the type inferred in vscode. See how the second parameter is just ArrayBuffer instead of TransferDescriptor?

Now it is possible for me to specify the array buffer variable without using Transfer on the call site. But does this mean it's not zero-copy? I am wary of not using Transfer since the docs say it is needed to ensure that it is in fact zero copied.

For now I have to use //@ts-ignore.

CMCDragonkai · 2021-07-15T05:41:32Z

Ok I have got it working now. Is this right way to do all of this:

  encryptWithKey(
    key: ArrayBuffer,
    keyOffset: number,
    keyLength: number,
    plainText: ArrayBuffer,
    plainTextOffset: number,
    plainTextLength: number
  ): TransferDescriptor<[ArrayBuffer, number, number]> {
    const key_ = Buffer.from(key, keyOffset, keyLength);
    const plainText_ = Buffer.from(plainText, plainTextOffset, plainTextLength);
    const cipherText = utils.encryptWithKey(
      key_,
      plainText_
    );
    return Transfer(
      [
        cipherText.buffer,
        cipherText.byteOffset,
        cipherText.byteLength
      ],
      [
        cipherText.buffer
      ]
    );
  },

And

  cipherText = await workerManager.call(
    async w => {
      const [arrayBuf, arrayBufOffset, arrayBufLength]= await w.encryptWithKey(
        Transfer(key.buffer),
        key.byteOffset,
        key.byteLength,
        // @ts-ignore
        Transfer(plainText.buffer),
        plainText.byteOffset,
        plainText.byteLength
      );
      return Buffer.from(arrayBuf, arrayBufOffset, arrayBufLength);
    }
  );

I'm concerned about the part where I have to use @ts-ignore.

Furthermore the comments say that, if I use Transfer the thread cannot longer use it. But I find that I am able to use key and plainText. Is this a concern? Since I will need to share that key with other threads later.

andywer · 2021-07-16T21:39:34Z

Hey @CMCDragonkai!

Good to see you figured most of it out already. Yes, so the way to invoke functions that use transferable objects is very closely related to the way you would use transferable objects without threads.js – mainly for performance reasons.

There are still a few misconceptions, I think, and the code can also be improved. How about this (haven't tried to run it, but it should convey the idea):

encryptWithKey(
    key: {
        data: ArrayBuffer,
        offset: number,
        length: number,
    },
    plainText: {
        data: ArrayBuffer,
        offset: number,
        length: number,
    }
  ): TransferDescriptor<[ArrayBuffer, number, number]>

and

  cipherText = await workerManager.call(
    async w => {
      const [arrayBuf, arrayBufOffset, arrayBufLength]= await w.encryptWithKey(
        Transfer({ data: key.buffer, offset: key.byteOffset, length: key.byteLength }, key.buffer),
        Transfer({ data: plainText.buffer, offset: plainText.byteOffset, length: plainText.byteLength }, plainText.buffer),
      );
      return Buffer.from(arrayBuf, arrayBufOffset, arrayBufLength);
    }
  );

You could then simplify it further by not crafting a new kind of object that resembles the node.js Buffer, but actually passing the Buffer. To make that work you will probably need to write a serializer/deserializer for Buffers, though, I guess:

import { registerSerializer, SerializerImplementation } from "threads"

interface SerializedBuffer {
  __type: "$$Buffer"
  buffer: ArrayBuffer
  byteOffset: number
  byteLength: number
}

const BufferSerializer: SerializerImplementation = {
  deserialize(thing, defaultHandler) {
    if (thing && thing.__type === "$$Buffer") {
      return Buffer.from((thing as any).buffer, (thing as any).byteOffset, (thing as any).byteLength)
    } else {
      return defaultHandler(thing)
    }
  },
  serialize(thing, defaultHandler) {
    if (thing instanceof Buffer) {
      return {
        __type: "$$Buffer",
        buffer: thing.buffer,
        byteOffset: thing.byteOffset,
        byteLength: thing.byteLength
      }
    } else {
      return defaultHandler(thing)
    }
  }
}

registerSerializer(BufferSerializer)

See https://threads.js.org/usage-advanced#custom-message-serializers for details. I have to admit, the documentation is not so easy to understand for these advanced features…

CMCDragonkai · 2021-07-17T05:37:48Z

@andywer thanks for the advice. However I'm curious as to how to deal with the detachment of the array buffer. If the array buffer from the main thread is detached, how can I re-use the key and plaintext buffers for a subsequent operation on threadsjs?

Furthermore the Node Buffer's ArrayBuffer did not get detached, so I found that it was still a copy.

andywer · 2021-07-17T17:58:53Z

If the array buffer from the main thread is detached, how can I re-use the key and plaintext buffers for a subsequent operation on threadsjs?

That's why it's called transferable objects: You transfer the data from one thread to another instead of copying it. Of course you cannot use it in the source thread anymore, unless the second thread transfers it back after it's done.

Your use case sounds as if you really aim to be able to use the keys in both threads independently. If that's the case then you might actually want to not transfer the data, but have it copied.

Furthermore the Node Buffer's ArrayBuffer did not get detached, so I found that it was still a copy.

Using my code sample?

CMCDragonkai · 2021-07-18T03:37:14Z

Yes the same key is going to be re-used over and over to encrypt different blocks. The key never changes once it is created. So that's why I think it's a good idea for it to do zero copy for each call using the same key. You said it could work if I transfer it back. But that means the source thread reference becomes invalid, and it wouldn't work in concurrent situations where the source thread is sending the key to be used for encryption of different chunks concurrently. I haven't tried with Node Buffer involving your example yet though.

…

On 18 July 2021 3:59:03 am AEST, Andy Wermke ***@***.***> wrote: > If the array buffer from the main thread is detached, how can I re-use the key and plaintext buffers for a subsequent operation on threadsjs? That's why it's called *transferable* objects: You transfer the data from one thread to another instead of copying it. Of course you cannot use it in the source thread anymore, unless the second thread transfers it back after it's done. Your use case sounds as if you really aim to be able to use the keys in both threads independently. If that's the case then you might actually want to not transfer the data, but have it copied. > Furthermore the Node Buffer's ArrayBuffer did not get detached, so I found that it was still a copy. Using my code sample? -- You are receiving this because you were mentioned. Reply to this email directly or view it on GitHub: #348 (comment)

-- Sent from my Android device with K-9 Mail. Please excuse my brevity.

andywer · 2021-07-18T05:06:11Z

So that's why I think it's a good idea for it to do zero copy for each call using the same key.

I get that, but it really comes down to copying vs. moving between threads. If you need it in both threads simultaneously you should copy it.

What would make sense, though, is to copy it once only. So you might want to assign each key an ID or calculate a hash, copy the keys to the other threads once only and then pass their ID/hash on each call instead of passing the whole key.

Can't help but think that this would be much easier to do if #273 was merged, so other threads can call the main thread (to request a key by it ID/hash if the key has not been cached in the thread yet).

andywer · 2021-07-18T05:34:35Z

The other question is: What's the size of those keys? If it's insignificant compared to the size of the data that you want to encrypt, it might not even be worth optimizing… 😉

PS: #273 could also make it possible to pass streams to workers. Might be really valuable if you need to encrypt large amounts of data.

CMCDragonkai · 2021-07-18T06:25:07Z

I might have misinterpreted how the Transfer works. I thought it was going to be a sort of shared memory construct where I can manipulate that memory block in one thread, and have it read in the other thread. But it seems that there has to be this explicit passing around of references... Yea so I could pass the key as a cache, but the keys are just going to be 32 bytes long so it's not a huge deal. But it's not that I have a large buffer to encrypt, it's that I will have MANY small buffers to encrypt and decrypt. And I thought it would be good to have subthreads operate on the shared memory rather copying all the time. Now I still have to check if Node buffers work, because from my preliminary testing it doesn't work. Node array buffers are never detached when I used `Transfer` on them.

…

On 7/18/21 3:06 PM, Andy Wermke wrote: So that's why I think it's a good idea for it to do zero copy for each call using the same key. I get that, but it really comes down to copying vs. moving between threads. If you need it in both threads simultaneously you should copy it. What would make sense, though, is to copy it once only. So you might want to assign each key an ID or calculate a hash, copy the keys to the other threads once only and then pass their ID/hash on each call instead of passing the whole key. Can't help but think that this would be much easier to do if #273 <#273> was merged, so other threads can call the main thread (to request a key by it ID/hash if the key has not been cached in the thread yet). — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#348 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAE4OHNENZ2LFDC5ESFVVP3TYJOM3ANCNFSM4Z2WUT3Q>.

andywer · 2021-07-18T09:23:04Z

I think node.js Buffers should work in principle. However, have you considered just passing the array buffers? They are transferable as a whole and the node.js Buffers are basically just a fancy wrapper around them.

Might be easier to pass the array buffers themselves and if you really need some of the Buffer functionality, you could Buffer.from(arrayBuffer) on the other side.

The Transfer() really transfers ownership. It's gone from the source thread afterwards. There is, however, a shared memory array buffer: See SharedArrayBuffer. Should work with node.

CMCDragonkai · 2021-07-18T13:21:28Z

Oh this might explain why Node Buffer's array buffers never get detached. The docs say that it cannot be detached. And inside Node Buffer's ArrayBuffer it is never detached after Transfer. I can use ArrayBuffer too, but so much of the other code uses Node Buffers. This is being used for transparent filesystem encryption.

…

On 7/18/21 7:23 PM, Andy Wermke wrote: I think node.js Buffers should work in principle. However, have you considered just passing the array buffers? They are transferable as a whole and the node.js Buffers are basically just a fancy wrapper around them. Might be easier to pass the array buffers themselves and if you really need some of the Buffer functionality, you could |Buffer.from(arrayBuffer)| on the other side. The |Transfer()| really /transfers/ ownership. It's gone from the source thread afterwards. There is, however, a shared memory array buffer: See SharedArrayBuffer <https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Global_Objects/SharedArrayBuffer>. Should work with node. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#348 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AAE4OHMQNY5XBVR65M2JJILTYKMQHANCNFSM4Z2WUT3Q>.

CMCDragonkai · 2021-09-07T11:53:36Z

If I used SharedArrayBuffer, would Transfer still be required?

andywer · 2021-09-07T15:05:08Z

I didn't try yet myself, but according to this answer you don't need Transfer(), according to the spec you apparently must not, in fact (that's 5 years old information, though).

CMCDragonkai · 2021-09-17T05:36:35Z

Hi @andywer I think there's still a problem with Transfer types. I remember you added StripTransfer, however I believe this only applies to the first parameter. The second parameter still requires a @ts-ignore.

w.transferBuffers(
  Transfer(x),
  // @ts-ignore: threadsjs type is wrong
  Transfer(y)
);

The second Transfer isn't allowed by typescript inference.

Also I have a question about this situation...

In the actual method signature for the worker modules, we don't use TransferDescriptor on the argument types, why do we have to put it on the return type. Either they should be on both input and output, or neither. I'd argue for neither... but it doesn't work atm.

andywer · 2021-09-17T20:37:51Z

I am not a 100% sure about the specifics anymore, but I will give it a shot…

In the actual method signature for the worker modules, we don't use TransferDescriptor on the argument types, why do we have to put it on the return type.

I don't think you have to type the worker function return type as TransferDescriptor at all. It just ends up there if you return some Transfer(x). That's why the StripTransfer<> util type is then needed in the main thread. The worker function does return a Transfer(x), telling threads.js that x shall be transferred, not copied, but the main thread eventually receives just the plain x, either way.

We would have to do the same thing with the worker function parameters, but there is one good reason why we don't need to: The worker function signature used in the main thread is derived from the worker's types, but we never use any main thread types in the worker, so we only need to do StripTransfer<> one way, not in the other direction.

The second Transfer isn't allowed by typescript inference.

What's the error?

CMCDragonkai · 2021-09-18T06:55:02Z

The error is:

Argument of type 'TransferDescriptor<any>' is not assignable to parameter of type 'ArrayBuffer'.
  Type 'TransferDescriptor<any>' is missing the following properties from type 'ArrayBuffer': byteLength, slice, [Symbol.toStringTag]ts(2345)

The inferred type from w.encrypt is:

(method) encrypt(args_0: ArrayBuffer | TransferDescriptor<ArrayBuffer>, args_1: ArrayBuffer): ObservablePromise<ArrayBuffer>

Do you see how the second parameter doesn't get the TransferDescriptor option?

This is my "worker module"

const dbWorker = {
  async encrypt(
    key: ArrayBuffer,
    plainText: ArrayBuffer,
  ): Promise<TransferDescriptor<ArrayBuffer>> {
    const cipherText = await utils.encrypt(key, plainText);
    return Transfer(cipherText);
  },
  async decrypt(
    key: ArrayBuffer,
    cipherText: ArrayBuffer,
  ): Promise<TransferDescriptor<ArrayBuffer> | undefined> {
    const plainText = await utils.decrypt(key, cipherText);
    if (plainText != null) {
      return Transfer(plainText);
    } else {
      return;
    }
  },
};

CMCDragonkai · 2022-10-12T01:49:00Z

This is still a problem btw, the StripTransfer only solves the problem for the first parameter. Not the second or subsequent parameters. The StripTransfer should be applied to all potential parameters to the worker function.

andywer added the bug label Mar 27, 2021

andywer mentioned this issue Mar 27, 2021

Resolve transferable object types on call #352

Merged

andywer closed this as completed in #352 Apr 18, 2021

CMCDragonkai mentioned this issue Jul 16, 2021

What is the correct way to ensure zero copy of an ArrayBuffer and the ability to transfer back to the main thread (without reassignment) #385

Open

CMCDragonkai mentioned this issue Sep 7, 2021

Integrate Worker Manager into DB for parallel decryption/encryption MatrixAI/Polykey#220

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using the types TransferDescriptor in typescript #348

Using the types TransferDescriptor in typescript #348

CMCDragonkai commented Mar 26, 2021 •

edited

Loading

andywer commented Mar 27, 2021

CMCDragonkai commented Jul 15, 2021

CMCDragonkai commented Jul 15, 2021

CMCDragonkai commented Jul 15, 2021

CMCDragonkai commented Jul 15, 2021

CMCDragonkai commented Jul 15, 2021

andywer commented Jul 16, 2021

CMCDragonkai commented Jul 17, 2021

andywer commented Jul 17, 2021

CMCDragonkai commented Jul 18, 2021 via email

andywer commented Jul 18, 2021

andywer commented Jul 18, 2021

CMCDragonkai commented Jul 18, 2021 via email

andywer commented Jul 18, 2021

CMCDragonkai commented Jul 18, 2021 via email

CMCDragonkai commented Sep 7, 2021

andywer commented Sep 7, 2021

CMCDragonkai commented Sep 17, 2021 •

edited

Loading

andywer commented Sep 17, 2021

CMCDragonkai commented Sep 18, 2021

CMCDragonkai commented Oct 12, 2022

Using the types TransferDescriptor in typescript #348

Using the types TransferDescriptor in typescript #348

Comments

CMCDragonkai commented Mar 26, 2021 • edited Loading

andywer commented Mar 27, 2021

CMCDragonkai commented Jul 15, 2021

CMCDragonkai commented Jul 15, 2021

CMCDragonkai commented Jul 15, 2021

CMCDragonkai commented Jul 15, 2021

CMCDragonkai commented Jul 15, 2021

andywer commented Jul 16, 2021

CMCDragonkai commented Jul 17, 2021

andywer commented Jul 17, 2021

CMCDragonkai commented Jul 18, 2021 via email

andywer commented Jul 18, 2021

andywer commented Jul 18, 2021

CMCDragonkai commented Jul 18, 2021 via email

andywer commented Jul 18, 2021

CMCDragonkai commented Jul 18, 2021 via email

CMCDragonkai commented Sep 7, 2021

andywer commented Sep 7, 2021

CMCDragonkai commented Sep 17, 2021 • edited Loading

andywer commented Sep 17, 2021

CMCDragonkai commented Sep 18, 2021

CMCDragonkai commented Oct 12, 2022

CMCDragonkai commented Mar 26, 2021 •

edited

Loading

CMCDragonkai commented Sep 17, 2021 •

edited

Loading