The RDKit's code for doing R-group decomposition (RGD) is quite flexible but also rather "undocumented". Thanks to that fact, you may not be aware of some of the cool stuff that's there. This post is an attempt to at least begin to remedy that.
We'll look at a number of difficult/interesting problems that arise all the time when doing RGD on real-world datasets:
Here's the nbviewer iframe:
We'll look at a number of difficult/interesting problems that arise all the time when doing RGD on real-world datasets:
- Handling symmetric cores
- Handling stereochemistry
- Handling sidechains that attach to the core at more than one point
- Handling multiple scaffolds or variable scaffolds
Here's the nbviewer iframe: