# Project Introduction

<figure><img src="/files/Ulx4kbPZBhHaqKgomqIu" alt=""><figcaption><p>Operon Structure</p></figcaption></figure>

In bacteria, a set of adjacent genes on the same strand of DNA can be regulated by the same promoter. They are transcribed into a single mRNA molecule. Translation of the single mRNA molecule then yields the individual proteins. This is operon

Genes in the same operon are close to each other. Genes in the same operon have similar expression patterns. Genes in different operons are further away from each other. Genes in different operons tend to have diverse expression patterns.

In Bacillus subtilis, 10% operons are known from experiments. We can use different learning methods to predict the operon structure for the remaining 90% of the genes. we need to choose some predictor variables that can be measured easily and are somehow related to the operon structure. Here we choose two predictor variables: x1: the number of base pairs between the gene pair x2: the similarity score in expression profile.

Here is our data:

<figure><img src="/files/x0QBq3asM27Q6gAnOm5G" alt=""><figcaption></figcaption></figure>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://igb.mit.edu/mini-courses/python/machine-learning-with-python/hands-on/project-introduction.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
