Protocols for HIS data acquisition, data synchronization and import mechanisms
Data collection may be the most critical step toward building an AI model. Multiple choices are made in advance and remain constant throughout the data collection process. The core choices to be made are:
Wavelength Range Choice
The first thing to choose is the wavelength range and Full width at half maximum (FWHM) that suits the specific use case needs. Literature can provide insights for both. This, in turn, will determine the sensor to be used.
Distance
Next, the distance between the sensor and the sample should be constant throughout the measurements.
Model Robustness and Accuracy
To strengthen the model’s robustness and accuracy, the samples used should have a reasonable variation in the characteristic under evaluation.
Sample Number
The sample number should be at least 200.
Lighting System
The next choice is the lighting system in case the measurements occur in a lab. Based on the wavelength of choice, lamp performance can vary. Once again, literature can provide insights, or pre-experiments can be conducted to determine the most suitable one. In outdoor measurements, sessions should be conducted, preferably when the sun is at its peak and when there are no clouds.