Skip to content

integrativemodeling/mediator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

77 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DOI

The 21-subunit Mediator complex transduces regulatory information from enhancers to promoters, and performs an essential role in the initiation of transcription in all eukaryotes. In this study we performed chemical cross-linking and mass spectrometry, and combined the results with information from X-ray crystallography, homology modeling, and cryo-electron microscopy by an integrative modeling approach to determine a 3-D model of the entire Mediator complex. The approach is validated by the use of X-ray crystal structures as internal controls and by consistency with previous results from electron microscopy and yeast two-hybrid screens. The model shows the locations and orientations of all Mediator subunits, as well as subunit interfaces and some secondary structural elements. Segments of 20-40 amino acid residues are placed with an average precision of 20 Å. The model reveals roles of individual subunits in the organization of the complex.

Compatibility:

The scripts were run for the publication with IMP.pmi version from 915c00bac191ab32f022ae425facf538b64e3e54 to 32a774583007c3a7135c8adf970846ea7a52f453 and IMP version 3d7d35a367342a94655b5ca2f75cc4d4fb2d5c71. They should also work with any recent version of IMP.

Running the modeling script:

For more details on how to install IMP, run the modeling scripts and analyze the results using IMP and IMP.pmi see the IMP tutorial.

In brief, to use Replica Exchange, IMP must be compiled using an openmpi c++ compiler. We suggest to compile openmpi with the --disable-dlopen flag. See the IMP building instructions.

To run the modeling script, access the sampling/modeling directory and then run with 64 threads (64 replicas):

mpirun -np 64 python modeling.py

Content of the directories:

analysis: analysis scripts and clustering results of the whole ensemble of solutions, the two ensemble halves, and the jackknifing

sampling: sampling and modeling scripts, all input data

analysis directory:

clustering: the clustering results for the whole ensemble of solutions

clustering_half1: the clustering results for the ensemble of solutions considering the first half of all models produced

clustering_half2: the clustering results for the ensemble of solutions considering the second half of all models produced

clustering_jackknifing: the clustering results for the ensemble of solutions obtained jackknifing the cross-link dataset

present in all analysis/clustering* directories

clustering.py: clustering script

kmeans_weight_500_1: the set of 500 best scoring models collected in a single cluster

kmeans_weight_500_4: the set of 500 best scoring models divided into 4 clusters

content of kmeans_weight_500_4 directory

cluster.0: data for cluster 1

cluster.1: data for cluster 2

cluster.2: data for cluster 3

cluster.3: data for cluster 4

precision.0.0.out,precision.0.1.out,precision.0.2.out,precision.0.3.out ... precision.3.3.out: files containing the precision of a cluster (i.e., the files with the same indexes, precision.i.i.out, e.g., precision.0.0.out) and the files containing the distance between the clusters (i.e., files with different indexes, precision.i.j.out).

content of cluster.* directories

0.pdb,1.pdb,2.pdb....: the pdb files of the solutions

0.rmf3,1.rmf3,2.rmf3,...: the rmf files of the solution (can be opened with UCSF Chimera)

rmsf.med10.dat,rmsf.med11.dat,...: text file of the RMSF analysis

rmsf.med10.pdf,rmsf.med11.pdf,...: pdf file of the RMSF analysis

show_localization.py: chimera session file to display the localization densities

stat.out: stat file containing all relevant information on the score, etc.

view1_matrix: used by Chimera session file.

view2_matrix: used by Chimera session file.

XL_table_middle.pdf: pdf file of the cross-link map for the middle module

XL_table_tail.pdf: pdf file of the cross-link map for the tail module

present only in analysis/clustering

graph_plotting.py: compute the graph of the interactions between the subunits

high_confidence_structure.py: get the cluster-center solution and map the RMSF of beads with a color. Need Chimera to display it.

precision_rmsf.py: calculate the precision of clusters, their mutual distance and the RMSF

XL_table_middle.py: calculate the contact map and the cross-link map of the middle module

XL_table_tail.py: calculate the contact map and the cross-link map of the tail module

sampling directory:

CXMS_files: cross-link datasets

em_map_files: input electron microscopy maps, with their GMM representation

fasta_files: fasta files with primary sequences of Mediator subunits

model_gmm_files: GMM representation of Mediator domains

modeling: modeling script

pdb_files: crystallographic structures and homology models

sampling/CXMS_files directory:

full_med_splitmods.txt: mediator cross-link dataset, cvs file.

jackknife_analysis: contains jackknifed datasets

sampling/em_map_files directory:

asturias_mediator.mrc: the input file from Asturias lab EMD-2634

asturias_mediator_translated.mrc: the map translated in the coordinate center.

asturias_middle_module_translated.mrc: the middle module density

asturias_middle_module_translated_resampled.mrc: resampled middle module.

asturias_middle_module_translated_resampled.mrc.gmm.29.mrc: mrc file with the GMM approximation of the middle module density, with 29 gaussians.

asturias_middle_module_translated_resampled.mrc.gmm.29.txt: text file with 29 gaussian coordinates and covariance tensors for the middle module.

asturias_tail_module_translated.mrc: tail module density.

asturias_tail_module_translated_resampled.mrc: resampled tail module.

asturias_tail_module_translated_resampled.mrc.gmm.49.mrc: mrc file with the GMM approximation of the tail module density, with 49 gaussians.

asturias_tail_module_translated_resampled.mrc.gmm.49.txt: text file with 49 gaussian coordinates and covariance tensors for the tail module.

calculate.density.sh: script used to calculate the GMMs.

sampling/pdb_files directory:

cr_mid_fullmed10.pdb: middle module pdb

head_module_em_aligned_translated.pdb: head module pdb

med16.NTD.phyre.model.pdb: homology model of med16

Information

Author(s): Riccardo Pellarin, Charles Greenberg

Date: September 24th, 2015

License: CC-BY-SA-4.0. This work is freely available under the terms of the Creative Commons Attribution-ShareAlike 4.0 International License.

Last known good IMP version: build info build info

Publications: