Isolating black box objective functions#
Note
To run this example, you need to have conda
installed.
The black-boxes inside poli
may have tricky dependencies that are incompatible the one with the other (e.g. clashes in biopython
versions). As a way to mitigate this, we provide conda
environments in which each black box can reliably run.
For example:
the
foldx
stability objective lives insidepoli__protein
, which hasbiopython
,pdb-tools
and other libraries that are required to interface well with proteins and their representations.the
dockstring
objective lives insidepoli__dockstring
, which includes babel as part of its installation.
poli
provides a way of running these objective functions in isolation.
Running objective functions in isolation#
Let’s say that your current environment doesn’t have the package dockstring
installed. You can still run the objective function:
# No output: we don't have it installed.
!pip freeze | grep dockstring
from poli.objective_repository import DockstringBlackBox
f = DockstringBlackBox(target_name="DRD2")
poli 🧪: Starting the function dockstring as an isolated process.
As you can see, poli
warns us that it is creating the conda environment associated with the black box function, and it is creating an isolated process where the objective can be run.
We can still evaluate f
like normal:
import numpy as np
# The example from dockstring's documentation: risperidone.
x0 = np.array(["CC1=C(C(=O)N2CCCCC2=N1)CCN3CCC(CC3)C4=NOC5=C4C=CC(=C5)F"])
f(x0)
/Users/sjt972/anaconda3/envs/poli__dockstring/lib/python3.9/site-packages/dockstring/utils.py:77: DockstringWarning: Although Mac use is supported, docking scores on Mac do not always perfectly match scores from Linux. Therefore, extra care should be taken when comparing results to other platforms. In particular, the baselines in the DOCKSTRING paper were computed on Linux, so please do not directly compare your docking scores to the scores reported on the paper.
warnings.warn(
array([[11.9]])
(Optional) What is happening under the hood?#
Each black box/problem implements all its complicated logic inside an AbstractIsolatedFunction
, which lives in a different submodule inside the problem’s folder.
When we create a black box instance (like f = DockstringBlackBox(target_name="DRD2")
here), one of two things happen:
If we are able to import this isolated function, we do it directly and store it in
f.inner_function
,If we are not able to import it (say, because we don’t have the right dependencies in our current environment), poli starts an isolated process running with a pre-defined conda environment, and still stores it in
f.inner_function
. Calls to this isolated process are handled usingmultiprocess
.
Read more#
You can check the individual dependencies of each black box inside their implementation, or in their documentation.