Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

Question

John Smith el 13 de Mzo. de 2023

0
Enlazar

Enlace directo a esta pregunta

https://es.mathworks.com/matlabcentral/answers/1927735-why-doesn-t-concatlayer-in-deep-learning-toolbox-concatenate-the-t-dimension

Comentada: Artem Lensky el 19 de Ag. de 2023

Hello,

While implementing a ViT transformer in Matlab, I found at that the concatLayer does not concatenate over the T dimension. This is needed to concatenate the class token with patch tokens, since the natural representation is CBT with C corresponding to features, B to batch and T to token within a batch (this is also the canonical representation in the attention function).

It's possible to work around this by hacking to e.g. SCB, but then other problems pop up which also need to be hacked around.

Thx

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Iniciar sesión para comentar.

Iniciar sesión para responder a esta pregunta.

Answer 1

Ben el 14 de Mzo. de 2023

1
Enlazar

Enlace directo a esta respuesta

https://es.mathworks.com/matlabcentral/answers/1927735-why-doesn-t-concatlayer-in-deep-learning-toolbox-concatenate-the-t-dimension#answer_1192820

You can create a layer that concatenates on the T dimension with functionLayer

sequenceCatLayer = functionLayer(@(x,y) cat(3,x,y));

This will work in dlnetwork to concatenate two CBT dlarray-s.

Since you're concatenating the class token, it might also be worth considering creating a custom layer that has the class token embedding as a Learnable property, and performs the concatenation in the predict method.

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Catalytic el 23 de Mzo. de 2023

Editada: Catalytic el 23 de Mzo. de 2023

@John Smith - Since Ben's answer yielded a solution for you, you should hit the Accept this Answer button, and likewise with other answers you might not have accepted.

Artem Lensky el 19 de Ag. de 2023

Are there any plans to make concatenationLayer support concatetnation along the T dimension?

Iniciar sesión para comentar.

Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

Why doesn't concatLayer in Deep Learning Toolbox concatenate the 'T' dimension?

0 comentarios Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

Respuesta aceptada

3 comentarios Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo

Más respuestas (0)

Ver también

Categorías

Etiquetas

Productos

Versión

Community Treasure Hunt

0 comentarios
Mostrar -2 comentarios más antiguosOcultar -2 comentarios más antiguos

3 comentarios
Mostrar 1 comentario más antiguoOcultar 1 comentario más antiguo