User talk:Chenguiuc

Page contents not supported in other languages.
From Wikipedia, the free encyclopedia

Pseudo Amino Acid Composition[edit]

Introduction

Pseudo amino acid composition, or PseAA composition, was originally introduced by Professor Kuo-Chen Chou [1] in 2001 to represent protein samples for statistical prediction. In contrast with the conventional amino acid (AA) composition that contains 20 components with each reflecting the occurrence frequency for one of the 20 native amino acids in a protein, the PseAA composition contains a set of greater than 20 discrete factors, where the first 20 represent the components of its conventional AA composition while the additional factors incorporate some sequence-order information via various modes. Typically, these additional factors are a series of rank-different correlation factors along a protein chain, but they can also be any combinations of other factors so long as they can reflect some sorts of sequence-order effects one way or the other. Therefore, the essence of PseAA composition is that on one hand it covers the AA composition, but on the other hand it contains the information beyond the AA composition and hence can better reflect the feature of a protein sequence through a discrete model. Ever since the concept of PseAA composition was introduced, it has been widely utilized to predict various protein attributes, such as protein subcellular localization, membrane protein type, enzyme functional class, GPCR type, protease type, protein structural class, and protein secondary structural content, among many others (see the references cited in [2] and [3]). Meanwhile, various different modes to formulate the PseAA composition have also been developed.

Background

In the history of developing methods for predicting subcellular localization of proteins and their other attributes, two kinds of models were generally used to represent protein samples: (1) the sequential model, and (2) the non-sequential model or discrete model. The most typical sequential representation for a protein sample is its entire amino acid sequence, which can contain its most complete information. This is an obvious advantage of the sequential model. To get the desired results, the sequence-similarity-search-based tools are usually utilized to conduct the prediction. However, this kind of approach failed to work when a query protein did not have significant homology to attribute-known proteins. Thus, various discrete models were proposed.

The simplest discrete model is using the AA (amino acid) composition to represent protein samples, as formulated as follows. Given a protein sequence with amino acid resides, i.e.,

File:Formula1.jpeg(1)

where represents the 1st residue of the protein, the 2nd residue, and so forth, according to the AA composition model, the protein of Eq.1 can be expressed by (2) where are the normalized occurrence frequencies of the 20 native amino acids in , and the transposing operator. Owing to its simplicity, the AA composition model was widely used in many earlier statistical prediction methods for protein attributes. However, all the sequence-order information would be lost by using the AA composition to represent a protein. This is its main shortcoming. To avoid completely losing the sequence-order information, the concept of PseAA (pseudo amino acid) composition was proposed by Professor Kuo-Chen Chou [1]. According to the PseAA composition model, the protein of Eq.1 can be formulated as (3) where the components are given by (4) where is the weight factor, and the tier correlation factor that reflects the sequence order correlation between all the most contiguous residues (Fig.1) as formulated by (5) with (6) where is the function of the amino acid , and the total number of the functions considered. For example, in the original paper by Professor Kuo-Chen Chou[1] , , and are respectively the hydrophobicity value, hydrophilicity value, and side chain mass of amino acid ; , , and the corresponding values for the amino acid ; the total number of functions considered is . It can be seen from Eq.3 that the first 20 components, i.e.,, are associated with the conventional AA composition of protein , while the remaining components are the correlation factors that reflect the 1st tier, 2nd tier, …, and the th tier sequence order correlation patterns (Fig.1). It is through these additional factors that some important sequence-order effects are incorporated.

Web Server

Note that in Eq.3 is a parameter of integer and that choosing a different integer for will lead to a dimension-different PseAA composition [2]. Also note that using Eq.6 is just one of the modes for deriving the correlation factors or PseAA components. The others, such as the physicochemical distance mode [4] and amphiphilic pattern mode [5], can also be used to derive different types of PseAA composition. In 2008 a free server called PseAAC [6] is provided at the website http://chou.med.harvard.edu/bioinf/PseAAC/. By using the web server, users can generate the PseAA composition for any given protein sequence by selecting the mode as desired.





Figure 1. A schematic drawing to show (a) the 1st-tier, (b) the 2nd-tier, and (3) the 3rd-tier sequence-order-correlation mode along a protein sequence, where represent the amino acid residue at the sequence position 1, at position 2, and so forth, and the coupling factors are given by Eq.6. Panel (a) reflects the correlation mode between all the most contiguous residues, panel (b) that between all the 2nd most contiguous residues, and panel (c) that between all the 3rd most contiguous residues. Adapted from [1] with permission.



References

[1] K. C. Chou, Prediction of protein cellular attributes using pseudo amino acid composition, PROTEINS: Structure, Function, and Genetics (Erratum: ibid., 2001, Vol.44, 60) 43 (2001) 246-255.
[2] K. C. Chou, H. B. Shen, Review: Recent progresses in protein subcellular location prediction, Analytical Biochemistry 370 (2007) 1-16.
[3] K. C. Chou, H. B. Shen, Cell-PLoc: A package of web-servers for predicting subcellular localization of proteins in various organisms, Nature Protocols 3 (2008) 153-162.
[4] K. C. Chou, Prediction of protein subcellular locations by incorporating quasi-sequence-order effect, Biochemical & Biophysical Research Communications 278 (2000) 477-483.
[5] K. C. Chou, Using amphiphilic pseudo amino acid composition to predict enzyme subfamily classes, Bioinformatics 21 (2005) 10-19.
[6] H. B. Shen, K. C. Chou, PseAAC: a flexible web-server for generating various kinds of protein pseudo amino acid composition, Analytical Biochemistry 373 (2008) 386-388.

Welcome :)[edit]

Welcome!

Hello, Chenguiuc, and welcome to Wikipedia! Thank you for your contributions. I hope you like the place and decide to stay. Here are some pages that you might find helpful:

I hope you enjoy editing here and being a Wikipedian! Please sign your messages on discussion pages using four tildes (~~~~); this will automatically insert your username and the date. If you need help, check out Wikipedia:Questions, ask me on my talk page, or ask your question and then place {{helpme}} before the question on your talk page. Again, welcome! Midorihana~いいですね? はい! 03:36, 23 March 2008 (UTC)[reply]


Image copyright problem with Image:Formula1.JPG[edit]

Image Copyright problem
Image Copyright problem

Thank you for uploading Image:Formula1.JPG. However, it currently is missing information on its copyright status. Wikipedia takes copyright very seriously. It may be deleted soon, unless we can determine the license and the source of the image. If you know this information, then you can add a copyright tag to the image description page.

If you have any questions, please feel free to ask them at the media copyright questions page. Thanks again for your cooperation. NOTE: once you correct this, please remove the tag from the image's page. STBotI (talk) 03:45, 23 March 2008 (UTC)[reply]


Proposed deletion of Low-frequency Collective Motion in Biomacromolecules and Its Biological Functions[edit]

The article Low-frequency Collective Motion in Biomacromolecules and Its Biological Functions has been proposed for deletion because of the following concern:

Original research

While all contributions to Wikipedia are appreciated, content or articles may be deleted for any of several reasons.

You may prevent the proposed deletion by removing the {{dated prod}} notice, but please explain why in your edit summary or on the article's talk page.

Please consider improving the article to address the issues raised. Removing {{dated prod}} will stop the Proposed Deletion process, but other deletion processes exist. The Speedy Deletion process can result in deletion without discussion, and Articles for Deletion allows discussion to reach consensus for deletion. eaolson (talk) 21:13, 29 August 2009 (UTC)[reply]

It seems you have created 3 almost identical articles. You should consider using WP:REDIRECT. Clubmarx (talk) 01:16, 30 August 2009 (UTC)[reply]

File permission problem with File:Pseaa figure1.jpeg[edit]

Thanks for uploading File:Pseaa figure1.jpeg. I noticed that while you provided a valid copyright licensing tag, there is no proof that the creator of the file agreed to license it under the given license.

If you created this media entirely yourself but have previously published it elsewhere (especially online), please either

  • make a note permitting reuse under the CC-BY-SA or another acceptable free license (see this list) at the site of the original publication; or
  • Send an email from an address associated with the original publication to permissions-en@wikimedia.org, stating your ownership of the material and your intention to publish it under a free license. You can find a sample permission letter here. If you take this step, add {{OTRS pending}} to the file description page to prevent premature deletion.

If you did not create it entirely yourself, please ask the person who created the file to take one of the two steps listed above, or if the owner of the file has already given their permission to you via email, please forward that email to permissions-en@wikimedia.org.

If you believe the media meets the criteria at Wikipedia:Non-free content, use a tag such as {{non-free fair use in|article name}} or one of the other tags listed at Wikipedia:File copyright tags#Fair use, and add a rationale justifying the file's use on the article or articles where it is included. See Wikipedia:File copyright tags for the full list of copyright tags that you can use.

If you have uploaded other files, consider checking that you have provided evidence that their copyright owners have agreed to license their works under the tags you supplied, too. You can find a list of files you have created in your upload log. Files lacking evidence of permission may be deleted one week after they have been tagged, as described on criteria for speedy deletion. If you have any questions please ask them at the Media copyright questions page. Thank you. Diannaa (Talk) 19:23, 24 March 2011 (UTC)[reply]

Notice

The file File:Pseaa formula1.jpeg has been proposed for deletion because of the following concern:

unused, can be easily rendered using wiki markup

While all constructive contributions to Wikipedia are appreciated, pages may be deleted for any of several reasons.

You may prevent the proposed deletion by removing the {{proposed deletion/dated files}} notice, but please explain why in your edit summary or on the file's talk page.

Please consider addressing the issues raised. Removing {{proposed deletion/dated files}} will stop the proposed deletion process, but other deletion processes exist. In particular, the speedy deletion process can result in deletion without discussion, and files for discussion allows discussion to reach consensus for deletion. Ixfd64 (talk) 20:28, 17 December 2021 (UTC)[reply]