This is the chat model variant of the StripedHyena series developed by Together in collaboration with Nous Research.

StripedHyena uses a new architecture that competes with traditional Transformers, particularly in long-context data processing. It combines attention mechanisms with gated convolutions for improved speed, efficiency, and scaling. This model marks a significant advancement in AI architecture for sequence modeling tasks.

Model Information

Model ID

togethercomputer/stripedhyena-nous-7b

Context Length

32,768 tokens

Author

togethercomputer

Capabilities