Lookup Analytic
This use case shows you how we can use our MANN and aisquared libraries to build and configure a .air file which can be dragged & dropped into a widget created using air JS for use in the browser.
In this example, we will show how to create a
.air
file to perform a simple lookup of analytic results from an analytic that has already been run. To do this, we will create a simple example which will utilize a regular expression to identify strings of digits which could potentially be phone numbers. Once those are identified, the phone numbers will be stripped of all punctuation (characters such as parentheses and hyphens) and looked up in a fictional customer database to see if there is any additional information about that customer.For this notebook, the following dependencies are required:
This package is available on on pypi via
pip
. The following cell also runs the commands to install this dependency as well as imports them into the notebook environment.! pip install aisquared
import aisquared
import json
Now that the required packages have been installed and imported, it is time to create the results of the analytic. These results are going to be recorded in JSON form via a Python dictionary, with the top-level key being the digits of the phone number and the lower-level keys being the associated individual's name and customer ID.
This dictionary will then be saved to a JSON file, and we will then configure the
.air
file configuration around it.# Configure the example results
results = {
'1111111111' : {
'name' : 'John Doe',
'customerID' : 11111
},
'2222222222' : {
'name' : 'Jane Doe',
'customerID' : 22222
},
'3333333333' : {
'name' : 'Alice Doe',
'customerID' : 33333
},
'4444444444' : {
'name' : 'Bob Doe',
'customerID' : 44444
},
'5555555555' : {
'name' : 'Charlie Doe',
'customerID' : 55555
},
'6666666666' : {
'name' : 'Denise Doe',
'customerID' : 66666
},
'7777777777' : {
'name' : 'Evan Doe',
'customerID' : 77777
},
'8888888888' : {
'name' : 'Frank Doe',
'customerID' : 88888
},
'9999999999' : {
'name' : 'Gregory Doe',
'customerID' : 99999
}
}
# Save the analytic as a JSON file
file_name = 'analytic.json'
with open(file_name, 'w') as f:
json.dump(results, f)
In the following block, we configure the harvesting, preprocessing, analytic, postprocessing, and rendering steps. Once those are created, we add them all to the
ModelConfiguration
object and compile them into the .air
file.# For harvesting, we need to harvest using a regular expression
# that identifies possible phone numbers. The following lines of
# code configure such harvesting
regex = '/^[\.-)( ]*([0-9]{3})[\.-)( ]*([0-9]{3})[\.-)( ]*([0-9]{4})$/'
harvester = aisquared.config.harvesting.TextHarvester(
how = 'regex',
regex = regex
)
# The only preprocessing step that has to be conducted is the
# removal of all punctuation, since the phone numbers in the
# analytic JSON are all strictly digits
step = aisquared.config.preprocessing.RemoveCharacters(
remove_digits = False,
remove_punctuation = True
)
preprocesser = aisquared.config.preprocessing.TextPreprocessor()
preprocesser.add_step(step)
# The analytic for this configuration is going to be a LocalAnalytic
# class, where we pass the saved file to the object
analytic = aisquared.config.analytic.LocalAnalytic(file_name)
# No postprocessing steps are needed, so we can set that value to None
postprocesser = None
# To render results, we are going to use the WordRendering class to
# initialize the rendering of badges for individual words. By default,
# the WordRendering class renders the specific words/tokens that were
# input into the analytics
renderer = aisquared.config.rendering.WordRendering()
Now that the pieces are in place, we can put them together into a .air file that is ready to use in the AI Squared Extension.
# Finally, we will take the previous objects and put them all
# together into a single ModelConfiguration object, which is then
# compiled into the .air file
config = aisquared.config.ModelConfiguration(
name = 'PhoneNumberLookup',
harvesting_steps = harvester,
preprocessing_steps = preprocesser,
analytic = analytic,
postprocessing_steps = postprocesser,
rendering_steps = renderer,
version = None,
description = 'Phone number lookup which shows name and customer ID (if present) for phone numbers found',
mlflow_uri = None,
mlflow_user = None,
mlflow_token = None
)
# compile to create .air file
config.compile()
Last modified 1yr ago