Plumerai Video Intelligence JNI¶
This document describes the Java Native Interface (JNI) for the Plumerai Video Intelligence software for videos.
Compilation¶
When running with the JNI, make sure to point the Java library path to where the Plumerai video intelligence shared object can be found, e.g. java -Djava.library.path=path/to/plumerai/
. Furthermore, be sure to include both of the Plumerai Java source files in your project: PlumeraiBoxPrediction.java
and PlumeraiVideoIntelligence.java
.
Android¶
If you are using the JNI API from Android, simply place the libplumeraivideointelligence.so
file in the jniLibs
folder for your target, e.g. in app/src/main/jniLibs/arm64-v8a
, and proceed as normal.
If you encounter errors regarding a missing libc++_shared.so
file, then you have two options to solve it:
- The file
libc++_shared.so
is provided by the Android NDK, so you can simply copy that file from the NDK to the folder where you also putlibplumeraivideointelligence.so
and it will be automatically loaded. See also this suggestion. For example this might be done as follows:cp ~/Android/ndk/26.1.10909125/toolchains/llvm/prebuilt/linux-x86_64/sysroot/usr/lib/aarch64-linux-android/libc++_shared.so ~/AndroidTestApp/app/src/main/jniLibs/arm64-v8a/
(modify as needed for your system and project). - If you don't compile any C++ code yet, first add a dummy C++ source file to your Android project. Then, in your Gradle file, add
arguments '-DANDROID_STL=c++_shared'
andcppFlags ''
to the sectionandroid { defaultConfig { externalNativeBuild { cmake {
(you might have to create theexternalNativeBuild
andcmake
sections). That should trigger Gradle to addlibc++_shared.so
to your APK file. For more information about this solution, see this thread or this stackoverflow issue.
The API¶
The JNI API consists of two Java source files which are self-documented. It is simple enough: there is a constructor that needs to be run once, and a process_frame
method that must be called for every input frame. There is an example below.
PlumeraiVideoIntelligence¶
Initializes a new video intelligence object.
This needs to be called only once at the start of the application.
Arguments
- height
int
: The height of the input image in pixels. - width
int
: The width of the input image in pixels.
Returns
Nothing.
deletePlumeraiVideoIntelligence¶
Destructor, should be called manually at the end to clean-up.
Arguments
None.
Returns
Nothing.
processFrame¶
public int processFrame(ByteArray image_data, PlumeraiBoxPrediction[] results,
int max_results, float delta_t);
Process a single frame from a video sequence.
This version supports RGB888 input. Note that the algorithm comes with a built-in threshold (e.g. 0.6 - this differs per model): bounding boxes with confidences lower than that value won't be reported at all by this function.
Arguments
- image_data
ByteBuffer
: A ByteBuffer object with 3x8-bit RGB image data (1st byte red, 3rd blue) of sizeheight * width * 3
. The ByteBuffer needs to be allocated withallocateDirect
. - results
PlumeraiBoxPrediction[]
: The resulting bounding-boxes found in the frame, should be preallocated by the user to sizemax_results
. - max_results
int
: The maximum number of bounding-boxes that will be returned. - delta_t
float
: The time in seconds it took between this and the previous video frame (1/fps). If left to 0, then the system clock will be used to compute this value.
Returns
int
: an error code, which can be any of the following:
SUCCESS = 0
: Everything was OK.INTERNAL_ERROR = -1
: Should not occur, contact Plumerai if this happens.INVALID_DELTA_T = -2
: Thedelta_t
parameter should be >= 0.INVALID_BUFFER_TYPE = -100
: Theimage_data
buffer is not a direct ByteBuffer.INVALID_BUFFER_SIZE = -101
: Theimage_data
buffer is not the correct size.
BoxPrediction¶
public class PlumeraiBoxPrediction {
public float y_min; // top coordinate between 0 and 1 in height dimension
public float x_min; // left coordinate between 0 and 1 in width dimension
public float y_max; // bottom coordinate between 0 and 1 in height dimension
public float x_max; // right coordinate between 0 and 1 in width dimension
public float confidence; // between 0 and 1, higher means more confident
public int id; // the tracked identifier of this box
public int class_id; // the class of the detected object, see below
// Allowed values for the `class_id` variable above:
public static final int DETECTION_CLASS_UNKNOWN = 0;
public static final int DETECTION_CLASS_PERSON = 1;
public static final int DETECTION_CLASS_HEAD = 2;
public static final int DETECTION_CLASS_FACE = 3;
}
A structure representing a single resulting bounding box. Coordinates are between 0 and 1, the origin is at the upper left corner. Confidence values lie between 0 and 1. Note that the algorithm comes with a built-in threshold (e.g. 0.6 - this differs per model): bounding boxes with confidences lower than that value won't be produced at all by the Plumerai Video Intelligence functions.
Example usage¶
Below is an example of using the JNI shown above.
import com.plumerai.box_prediction.PlumeraiBoxPrediction;
import com.plumerai.video_intelligence.PlumeraiVideoIntelligence;
import java.nio.ByteBuffer;
public class PlumeraiVideoIntelligenceExample {
public static void main(String[] args) {
// Initialize the Plumerai Video Intelligence algorithm with 1600x1200 input
PlumeraiVideoIntelligence pvi = new PlumeraiVideoIntelligence(1200, 1600);
final int max_results = 10;
// Allocate storage for the input data, e.g. a camera frame
ByteBuffer image_data = ByteBuffer.allocateDirect(1600 * 1200 * 3);
// Loop over frames in a video stream
while (true) {
// Normally we would obtain the next frame here, e.g. from the camera
image_data = ...
// Prepare the results structure
PlumeraiBoxPrediction[] results = new PlumeraiBoxPrediction[max_results];
for (int i = 0; i < max_results; ++i) {
results[i] = new PlumeraiBoxPrediction();
results[i].id = -1; // to indicate invalid results, see below
}
// Run the Plumerai video intelligence algorithm
final float delta_t = 0.0f; // Should be set as the time between frames
final int error_code = pvi.processFrame(image_data, results,
max_results, delta_t);
if (error_code != PlumeraiVideoIntelligence.SUCCESS) {
System.err.printf("Plumerai video intelligence: error %d\n", error_code);
return;
}
// Print the results
for (int i = 0; i < max_results; ++i) {
PlumeraiBoxPrediction r = results[i];
if (results[i].id != -1) {
System.out.printf(
"Box #%d with confidence %.2f @ (x,y) -> (%.2f,%.2f)-(%.2f,%.2f)\n",
r.id, r.confidence, r.x_min, r.y_min, r.x_max, r.y_max);
}
}
}
// Clean-up
pvi.deletePlumeraiVideoIntelligence();
}
}