2024-04-16 Unstructured Data Meetup
A solution to limited context windows
and that's tricky
Turning words into numbers
documents = SimpleDirectoryReader("./data").load_data()
index = VectorStoreIndex.from_documents(documents)
query_engine = index.as_query_engine()
response = query_engine.query("What's up?")
print(response)
part of LlamaCloud
from llama_parse import LlamaParse
from llama_index.core import SimpleDirectoryReader
parser = LlamaParse(
result_type="markdown"
)
file_extractor = {".pdf": parser}
reader = SimpleDirectoryReader(
"./data",
file_extractor=file_extractor
)
documents = reader.load_data()
...plus everything on Hugging Face!
– Shawn "swyx" Wang, Latent.Space podcast
Follow me on Twitter: @seldo