hi @SergioEanX !!
Welcome to our community
Check this recipe as it shows how to use Langchain to ingest some pdfs:
https://github.com/weaviate/recipes/tree/main/integrations/langchain/loading-data
While you may not use Langchain entirely, it will give you some hints on how to use the unstructured. That recipe specifically doesn’t use unstructured, but there is a lot of docs covering this, like here:
Also, you can not only load a single pdf, but an entire folder of contents, like in here:
Let me know if this helps.
Also, check this Academy course we have on chunking, as this is not a “one size fits all”, and some changes can be done for each use case to improve the overall quality of your results:
Let me know if this helps!
Thanks!