5bcad9eef3
Created new python processors for text embeddings, inserting into Chroma, querying Chroma, querying ChatGPT, inserting into and querying Pinecone. Fixed some bugs in the Python framework. Added Python extensions to assembly. Also added ability to load dependencies from a requirements.txt as that was important for making the different vectorstore implementations play more nicely together. Excluded nifi-python-extensions-bundle from GitHub build because it requires Maven to use unpack-resources goal, which will not work in GitHub because it uses mvn compile instead of mvn install - ParseDocument - ChunkDocument - PromptChatGPT - PutChroma - PutPinecone - QueryChroma - QueryPinecone NIFI-12195 Added support for requirements.txt to define Python dependencies This closes #7894 Signed-off-by: David Handermann <exceptionfactory@apache.org> |
||
---|---|---|
.. | ||
nifi-py4j-bridge | ||
nifi-py4j-integration-tests | ||
nifi-py4j-nar | ||
nifi-python-extension-api | ||
nifi-python-framework | ||
nifi-python-framework-api | ||
nifi-python-test-extensions | ||
README.md | ||
pom.xml |
README.md
nifi-py4j-bundle module
The NiFi Py4J Bundle provides a linkage between NiFi's Java Process and Python. Py4J is the library used in order to launch an RPG server that can be used to communicate between the Java and Python Processes.
See the NiFi Python Developer's Guide for more information about how to build Processors in Python.
Debugging
There are times when it's helpful to enable remote debugging of the Python code. Because NiFi is responsible for launching the Python process, how to enable this may not be as straight-forward as when launching a Python process yourself. However, NiFi can be told to enable remote debugging when launching the Python process.
The manner in which you connect to the Python Process differs by IDE. Here, we will examine how to use VSCode's DebugPy.
Debugging Framework
The method for debugging the framework and debugging Processors is different. Typically, when performing debugging on the Framework itself, it is easiest to have NiFi enable a DebugPy listener when launching the Python process that hosts the Controller.
To enable remote debugging, NiFi will use pip
to install the debugpy
module into the environment used by the main Python process.
This process is used to discover available Processors and to create Processors. It is not used by Processors themselves.
Listen for Incoming Connections (Controller)
The following properties may be added to nifi.properties in order to enable remote debugging of the Controller process:
nifi.python.controller.debugpy.enable
: Indicates whether or not DebugPy should be used when launching hte Controller.
Defaults to false
. If set to true
, the Python process that is responsible for discovering and creating Processors
will be launched using DebugPy.
nifi.python.controller.debugpy.port
: The local port to use. Defaults to 5678
.
nifi.python.controller.debugpy.host
: The hostname to listen on. Defaults to localhost
.
nifi.python.controller.debugpy.logs.directory
: The directory to write DebugPy logs to. Defaults to ./logs
Note that these properties do not exist in the nifi.properties by default. This is intentional and is due to the fact that during any normal operations, this should not be used. This should be used only by developers wanting to debug the NiFi application itself.
Connecting to the Python Process
It is important, however, to note the host and port that the debugger is using. When establishing a connection to the remote debugger, the VSCode may be configured with both the local directory to use for Python source files, as well as the remote debugger.
Generally, the local directory should point to ${NIFI_SOURCE_DIRECTORY}/nifi-nar-bundles/nifi-py4j-bundle/nifi-python-framework/src/main/python/framework
.
The remote directory, which defaults to .
should be specified as ./python/framework
.
Debugging Processors
It is also important to enable remote debugging for Processors. We expect Processor developers to be able to do this, not just those who are maintaining the NiFi codebase. As a result, instructions for enabling remote debugging of Processors has been added to the NiFi Python Developer's Guide.