This is paper is recommended by my colleague Eunji. She states that this paper is well written and have a huge influence on the current molecular generation model

Since I personally don’t have a strong background on the molecular generation, I can will put my strength on introducing the motivation, background introduction and diffusion part.

Motivation & Background Introduction

Overall, the molecular generation is about generate a molecular to satisfy some function. For example, like insulin, or some enzyme to facilitate

In this paper, the motivation is to do some modification on the original large molecular so that it is more stable and easy to produce.

Noted that the modification is not by adding or cut some small chemical group on original group. The modification or “prediction” from my opinion is by generating the 3D geometry shape of the molecular by using the prior knowledge we have for the molecular. In other word, we can plot the chemistry expression on a 2D paper just like in the high school. However, what we don’t know is the angle between two group. This shape is really crucial to keep the function of the molecular and it is also crucial for the production. Since we want the most stable shape that can keep the function of molecular.

So far, we understand that the task for this paper is actually predicting the shape of the molecular so that it is stable and it has its original function.

However there is no explicit function that we can apply to find this molecular. Besides that, by iterating through all possible shape of the large molecular is intractable. Therefore the pipeline we have here is that first we use some computing method that generate a bunch of plausible shape. The stability can be approximately solve through so called Boltzmann method that based on statistical mechanics. We then select some molecular to run the simulation. According the simulation result, we then select some of the best candidates for the real chemical experiment.

For this paper we only focus on the first stage which is generating reasonable amount of plausible molecular.

Preliminaries