You are here

k-Nearest Neighbour (kNN)

In pattern recognition, the k-Nearest Neighbors algorithm (or k-NN for short) is a non-parametric method used for classification and regression. [Source: Wikipedia ]

HCS-Analyzer

Submitted by ChenLiang on Fri, 09/02/2016 - 21:59

High-throughput screening is a powerful technology principally used by pharmaceutical industries allowing the identification of molecules of interest within large libraries. Originally target based, cellular assays provide a way to test compounds (or other biological material such as small interfering RNA) in a more physiologically realistic in vitro environment. High-content screening (HCS) platforms are now available at lower cost, giving the opportunity for universities or research institutes to access those technologies for research purposes.

Rating: 
Average: 5 (1 vote)

MaturePred

Submitted by ChenLiang on Fri, 09/02/2016 - 21:59

MicroRNAs (miRNAs) are a set of short (19~24 nt) non-coding RNAs that play significant roles as posttranscriptional regulators in animals and plants. The ab initio prediction methods show excellent performance for discovering new pre-miRNAs. While most of these methods can distinguish real pre-miRNAs from pseudo pre-miRNAs, few can predict the positions of miRNAs. Among the existing methods that can also predict the miRNA positions, most of them are designed for mammalian miRNAs, including human and mouse. Minority of methods can predict the positions of plant miRNAs.

Rating: 
Average: 5 (1 vote)

HeteroMirPred

Submitted by ChenLiang on Fri, 09/02/2016 - 21:59

An ensemble classifier approach for microRNA precursor (pre-miRNA) classification was proposed based upon combining a set of heterogeneous algorithms including support vector machine (SVM), k-nearest neighbors (kNN) and random forest (RF), then aggregating their prediction through a voting system. Additionally, the proposed algorithm, the classification performance was also improved using discriminative features, self-containment and its derivatives, which have shown unique structural robustness characteristics of pre-miRNAs.

Rating: 
Average: 5 (1 vote)

CHNmiRD

Submitted by ChenLiang on Fri, 09/02/2016 - 21:59

MicroRNAs (miRNAs) play an important role in the development and progression of human diseases. The identification of disease-associated miRNAs will be helpful for understanding the molecular mechanisms of diseases at the post-transcriptional level. Based on different types of genomic data sources, computational methods for miRNA-disease association prediction have been proposed.

Rating: 
Average: 5 (1 vote)

miRSM

Submitted by ChenLiang on Tue, 01/09/2018 - 19:22

MicroRNA (miRNA) sponges with multiple tandem miRNA binding sequences can sequester miRNAs from their endogenous target mRNAs. Therefore, miRNA sponge acting as a decoy is extremely important for long-term loss-of-function studies both in vivo and in silico. Recently, a growing number of in silico methods have been used as an effective technique to generate hypotheses for in vivo methods for studying the biological functions and regulatory mechanisms of miRNA sponges.

Rating: 
3
Average: 3 (2 votes)

miRLocator

Submitted by ChenLiang on Fri, 09/02/2016 - 21:59

MicroRNAs (miRNAs) are a class of short, non-coding RNA that play regulatory roles in a wide variety of biological processes, such as plant growth and abiotic stress responses. Although several computational tools have been developed to identify primary miRNAs and precursor miRNAs (pre-miRNAs), very few provide the functionality of locating mature miRNAs within plant pre-miRNAs.

Rating: 
Average: 5 (1 vote)

miRNAss

Submitted by ChenLiang on Tue, 01/09/2018 - 19:24

Although many machine learning techniques have been proposed for distinguishing miRNA hairpins from other stem-loop sequences, most of the current methods use supervised learning, which requires a very good set of positive and negative examples. Those methods have important practical limitations when they have to be applied to a real prediction task. First, there is the challenge of dealing with a scarce number of positive (well-known) pre-miRNA examples.

Rating: 
4
Average: 3.5 (2 votes)

miRTP

Submitted by ChenLiang on Fri, 09/02/2016 - 21:59

We used a machine learning method, the nearest neighbor algorithm (NNA), to learn the relationship between miRNAs and their target proteins, generating a predictor which can then judge whether a new miRNA-target pair is true or not. We acquired 198 positive (true) miRNA-target pairs from Tarbase and the literature, and generated 4,888 negative (false) pairs through random combination. A 0/1 system and the frequencies of single nucleotides and di-nucleotides were used to encode miRNAs into vectors while various physicochemical parameters were used to encode the targets.

Rating: 
Average: 5 (1 vote)

GRNMF

Submitted by ChenLiang on Tue, 01/09/2018 - 17:03

MicroRNAs (miRNAs) play crucial roles in post-transcriptional regulations and various cellular processes. The identification of disease-related miRNAs provides great insights into the underlying pathogenesis of diseases at a system level. However, most existing computational approaches are biased towards known miRNA-disease associations, which is inappropriate for those new diseases or miRNAs without any known association information.

Rating: 
Average: 5 (1 vote)

BioSeq-Analysis

Submitted by ChenLiang on Tue, 01/09/2018 - 17:37

With the avalanche of biological sequences generated in the post-genomic age, one of the most challenging problems is how to computationally analyze their structures and functions. Machine learning techniques are playing key roles in this field. Typically, predictors based on machine learning techniques contain three main steps: feature extraction, predictor construction and performance evaluation. Although several Web servers and stand-alone tools have been developed to facilitate the biological sequence analysis, they only focus on individual step.

Rating: 
Average: 5 (1 vote)
Subscribe to k-Nearest Neighbour (kNN)