Am I Correct That Each Text Chunk (Per Document) Is Processed Sequentially in Graph Creation? #1615
Unanswered
darien-schettler
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi there, I'm doing a deep dive into the code base, and I noticed that while the documents (each row of the DF) are processed in parallel, the actual texts are processed sequentially. This obviously could be by design. The graph creation and processing of the documents is already highly parallel... or perhaps we don't want race conditions for node/relationship creation (i.e. We WANT to go one by one so the graph is created in in the same way that a human would reading front to back)?
Anyway, I just wanted to explicitly call it out and ask. Thanks in advance!
Currently, the code below will process each document sequentially.
I have include the full code for the
__call__
method from theGraphExtractor
class below.ps: Awesome library! Keep up the good work.
Beta Was this translation helpful? Give feedback.
All reactions