January 20, 2025
Handling Long Documents Made Easy
Current text embedding models, like BERT, are limited to processing only 512 tokens at a time, which hinders their effectiveness with long documents. This limitation often results in loss of context and nuanced understanding. However, Jina Embeddings v2 addresses this issue by supporting sequences upto 8192 tokens, allowing for the preservation of context and enhancing […]