StripedHyena Hessian 7B (base)

This is the base model variant of the StripedHyena series, developed by Together.

StripedHyena uses a new architecture that competes with traditional Transformers, particularly in long-context data processing. It combines attention mechanisms with gated convolutions for improved speed, efficiency, and scaling. This model marks an advancement in AI architecture for sequence modeling tasks.

Model Information

Model ID

togethercomputer/stripedhyena-hessian-7b

Context Length

32,768 tokens

Author

togethercomputer

Capabilities