(->biosequenceIndex index path)
Positional factory function for class clj_biosequence.core.biosequenceIndex.
Positional factory function for class clj_biosequence.core.biosequenceIndex.
(->biosequenceIndexReader index strm)
Positional factory function for class clj_biosequence.core.biosequenceIndexReader.
Positional factory function for class clj_biosequence.core.biosequenceIndexReader.
(->bzipped file)
Positional factory function for class clj_biosequence.core.bzipped.
Positional factory function for class clj_biosequence.core.bzipped.
(->fastaFile file alphabet opts)
Positional factory function for class clj_biosequence.core.fastaFile.
Positional factory function for class clj_biosequence.core.fastaFile.
(->fastaReader strm alphabet)
Positional factory function for class clj_biosequence.core.fastaReader.
Positional factory function for class clj_biosequence.core.fastaReader.
(->fastaSequence acc description alphabet sequence)
Positional factory function for class clj_biosequence.core.fastaSequence.
Positional factory function for class clj_biosequence.core.fastaSequence.
(->fastaString str alphabet)
Positional factory function for class clj_biosequence.core.fastaString.
Positional factory function for class clj_biosequence.core.fastaString.
(->gzipped file)
Positional factory function for class clj_biosequence.core.gzipped.
Positional factory function for class clj_biosequence.core.gzipped.
(->uncompressed file)
Positional factory function for class clj_biosequence.core.uncompressed.
Positional factory function for class clj_biosequence.core.uncompressed.
(->zipped file)
Positional factory function for class clj_biosequence.core.zipped.
Positional factory function for class clj_biosequence.core.zipped.
(bioseq->string bs)
Returns the sequence of a biosequence as a string with newlines every 80 chars.
Returns the sequence of a biosequence as a string with newlines every 80 chars.
(alphabet this)
Returns the alphabet of a biosequence.
Returns the alphabet of a biosequence.
(bs-seq this)
Returns the sequence of a biosequence as a vector.
Returns the sequence of a biosequence as a vector.
(keywords this)
Returns a collection of keywords.
Returns a collection of keywords.
(moltype this)
Returns the moltype of a biosequence.
Returns the moltype of a biosequence.
(protein? this)
Returns true if a protein and false otherwise.
Returns true if a protein and false otherwise.
(biosequence->file bs
file
&
{:keys [append func] :or {append true func fasta-string}})
Takes a collection of biosequences and prints them to file. To
append to an existing file use :append true
and the :func
argument can be used to pass a function that will be used to prepare
the printed output, the default is fasta-string
which will print
the biosequences to the file in fasta format. Returns the file.
Takes a collection of biosequences and prints them to file. To append to an existing file use `:append true` and the `:func` argument can be used to pass a function that will be used to prepare the printed output, the default is `fasta-string` which will print the biosequences to the file in fasta format. Returns the file.
(pstart this)
Returns the start page of a citation object.
Returns the start page of a citation object.
(authors this)
Returns the authors from a citation object.
Returns the authors from a citation object.
(pend this)
Returns the end page of a citation object.
Returns the end page of a citation object.
(crossrefs this)
Returns crossrefs - DOI, ISBN etc
Returns crossrefs - DOI, ISBN etc
(abstract this)
Returns the abstract
Returns the abstract
(journal this)
Returns the journal of a citation object.
Returns the journal of a citation object.
(title this)
Returns the title of a citation object.
Returns the title of a citation object.
(volume this)
Returns the volume of a citation object.
Returns the volume of a citation object.
(year this)
Returns the year of a citation object.
Returns the year of a citation object.
(pubmed this)
Returns the pubmed id of a reference if there is one.
Returns the pubmed id of a reference if there is one.
(citation-key this)
Returns a citation key from a record.
Returns a citation key from a record.
(citations this)
Returns a collection of references in a sequence record.
Returns a collection of references in a sequence record.
(comments this)
Returns comments.
Returns comments.
(filter-comments this value)
Filters comments based on the return value of obj-type.
Filters comments based on the return value of obj-type.
(database-name this)
Returns the name of the database.
Returns the name of the database.
(db-properties this)
Returns properties of the reference.
Returns properties of the reference.
(object-id this)
Returns the ID of an database object.
Returns the ID of an database object.
(get-db-refs this)
Returns db ref records.
Returns db ref records.
(description this)
Returns the description of a biosequence object.
Returns the description of a biosequence object.
(evidence this)
Returns evidence records.
Returns evidence records.
(operator this)
Returns an operator for dealing with intervals.
Returns an operator for dealing with intervals.
(feature-seq this)
Returns a lazy list of features in a sequence.
Returns a lazy list of features in a sequence.
(filter-features this name)
Returns a list of features that return 'name' when called using bs/bs-name from biosequenceName protocol.
Returns a list of features that return 'name' when called using bs/bs-name from biosequenceName protocol.
(bs-path this)
Returns the path of the file as string.
Returns the path of the file as string.
(locus this)
Returns the gene locus.
Returns the gene locus.
(locus-tag this)
Returns a locus tag.
Returns a locus tag.
(map-location this)
Returns the map location.
Returns the map location.
(orf this)
ORF associated with the gene.
ORF associated with the gene.
(products this)
The products of a gene.
The products of a gene.
(genes this)
Returns sub-seq gene records.
Returns sub-seq gene records.
(go-component this)
The GO component, molecular function etc.
The GO component, molecular function etc.
(go-id this)
The GO id.
The GO id.
(go-term this)
The GO term.
The GO term.
(gos this)
Returns go term records.
Returns go term records.
(accession this)
Returns the accession of a biosequence object.
Returns the accession of a biosequence object.
(accessions this)
Returns a list of accessions for a biosequence object.
Returns a list of accessions for a biosequence object.
(creation-date this)
Returns a java date object.
Returns a java date object.
(dataset this)
Returns the dataset.
Returns the dataset.
(discontinue-date this)
Returns a date object.
Returns a date object.
(update-date this)
Returns a java date object.
Returns a java date object.
(version this)
Returns the version of the accession nil if none.
Returns the version of the accession nil if none.
(comp? this)
Is the interval complementary to the biosequence sequence. Boolean
Is the interval complementary to the biosequence sequence. Boolean
(end this)
Returns the end position of an interval as an integer.
Returns the end position of an interval as an integer.
(point this)
Returns a point interval.
Returns a point interval.
(start this)
Returns the start position of an interval as an integer.
Returns the start position of an interval as an integer.
(strand this)
Returns '+', '-', '.' or '?' for positive, minus, not stranded or unknwon respectively.
Returns '+', '-', '.' or '?' for positive, minus, not stranded or unknwon respectively.
(intervals this)
Returns a list of intervals.
Returns a list of intervals.
(bs-reader this)
Returns a reader for a file containing biosequences. Use with `with-open'
Returns a reader for a file containing biosequences. Use with `with-open'
(allergen-names this)
Returns the allergen names.
Returns the allergen names.
(alternate-names this)
Returns the alternate names.
Returns the alternate names.
(biotech-names this)
Returns the biotech names.
Returns the biotech names.
(cd-antigen-names this)
Returns the cd names.
Returns the cd names.
(innnames this)
Returns the innname.
Returns the innname.
(names this)
Returns the default names of a record.
Returns the default names of a record.
(submitted-names this)
Returns the submitted names.
Returns the submitted names.
(obj-description this)
(obj-heading this)
(obj-id this)
(obj-label this)
(obj-name this)
(obj-text this)
(obj-type this)
(obj-value this)
(parameters this)
Returns parameters from a reader.
Returns parameters from a reader.
(activities this)
Returns a lit of activities.
Returns a lit of activities.
(calc-mol-wt this)
The calculated molecular weight.
The calculated molecular weight.
(ecs this)
Returns list of E.C numbers.
Returns list of E.C numbers.
(processed this)
Processing of the protein.
Processing of the protein.
(proteins this)
Returns protein sub-seq records.
Returns protein sub-seq records.
(biosequence-seq this)
Returns a lazy sequence of biosequence objects.
Returns a lazy sequence of biosequence objects.
(get-biosequence this accession)
Returns the biosequence object with the corresponding accession.
Returns the biosequence object with the corresponding accession.
(subcell-location this)
(subcell-orient this)
(subcell-topol this)
(summary this)
Returns the summary of a sequence.
Returns the summary of a sequence.
(synonyms this)
Returns a list of synonyms.
Returns a list of synonyms.
(tax-refs src)
Returns taxonomy records.
Returns taxonomy records.
(common-name this)
Returns the common name.
Returns the common name.
(lineage this)
Returns a lineage string.
Returns a lineage string.
(mods this)
Returns a list of modifications to taxonomy.
Returns a list of modifications to taxonomy.
(tax-name this)
Returns the taxonomic name.
Returns the taxonomic name.
(codon-start this)
The start codon.
The start codon.
(frame this)
Returns the frame a sequence should be translated in.
Returns the frame a sequence should be translated in.
(trans-table this)
Returns the translation table code to be used.
Returns the translation table code to be used.
(translation this)
Returns the translation of a sequence.
Returns the translation of a sequence.
(anchor this)
Text to show as highlight
Text to show as highlight
(post-text this)
Text after anchor
Text after anchor
(pre-text this)
Text before anchor.
Text before anchor.
(url this)
Returns the url.
Returns the url.
(original this)
Returns the original.
Returns the original.
(variant this)
Returns the variant.
Returns the variant.
(clean-sequences alphabet coll)
Removes spaces and newlines and checks that all characters are legal characters for the supplied alphabet. Replaces non-valid characters with \X. Only performs cleaning if supplied alphabet is a checked alphabet.
Removes spaces and newlines and checks that all characters are legal characters for the supplied alphabet. Replaces non-valid characters with \X. Only performs cleaning if supplied alphabet is a checked alphabet.
Default implementation of Biosequence protocol.
Default implementation of Biosequence protocol.
Default implementation of biosequenceCitation protocol.
Default implementation of biosequenceCitation protocol.
Default implementation of biosequenceCitations protocol.
Default implementation of biosequenceCitations protocol.
Default implementation of biosequenceComments protocol.
Default implementation of biosequenceComments protocol.
Default implementation of biosequenceDbRef protocol.
Default implementation of biosequenceDbRef protocol.
Default implementation of biosequenceDbRefs protocol.
Default implementation of biosequenceDbRefs protocol.
Default implementation of biosequenceDescription protocol.
Default implementation of biosequenceDescription protocol.
Default implementation of biosequenceFeature protocol.
Default implementation of biosequenceFeature protocol.
Default implementation of biosequenceFeatures protocol.
Default implementation of biosequenceFeatures protocol.
Default implementation of biosequenceFile protocol.
Default implementation of biosequenceFile protocol.
Default implementation of biosequenceGene protocol.
Default implementation of biosequenceGene protocol.
Default implementation of biosequenceGenes protocol.
Default implementation of biosequenceGenes protocol.
Default implementation of biosequenceGoTerm protocol.
Default implementation of biosequenceGoTerm protocol.
Default implementation of biosequenceGoTerms protocol.
Default implementation of biosequenceGoTerms protocol.
Default implementation of biosequenceID protocol.
Default implementation of biosequenceID protocol.
Default implementation of biosequenceInterval protocol.
Default implementation of biosequenceInterval protocol.
Default implementation of biosequenceIntervals protocol.
Default implementation of biosequenceIntervals protocol.
Default implementation of biosequenceName protocol.
Default implementation of biosequenceName protocol.
Default implementation of biosequenceNameObject protocol.
Default implementation of biosequenceNameObject protocol.
Default implementation of biosequenceNotes protocol.
Default implementation of biosequenceNotes protocol.
Default implementation of biosequenceProtein protocol.
Default implementation of biosequenceProtein protocol.
Default implementation of biosequenceProteins protocol.
Default implementation of biosequenceProteins protocol.
Default implementation of biosequenceStatus protocol.
Default implementation of biosequenceStatus protocol.
Default implementation of biosequenceSubCellLoc protocol.
Default implementation of biosequenceSubCellLoc protocol.
Default implementation of biosequenceSubCellLocs protocol.
Default implementation of biosequenceSubCellLocs protocol.
Default implementation of biosequenceSummary protocol.
Default implementation of biosequenceSummary protocol.
Default implementation of biosequenceSynonyms protocol.
Default implementation of biosequenceSynonyms protocol.
Default implementation of biosequenceTaxonomy protocol.
Default implementation of biosequenceTaxonomy protocol.
Default implementation of biosequenceTaxonomies protocol.
Default implementation of biosequenceTaxonomies protocol.
Default implementation of biosequenceTranslation protocol.
Default implementation of biosequenceTranslation protocol.
Default implementation of biosequenceUrl protocol.
Default implementation of biosequenceUrl protocol.
Default implementation of biosequenceVariant protocol.
Default implementation of biosequenceVariant protocol.
(fasta-string bioseq)
(fasta-string bioseq newlines?)
Returns the biosequence as a fasta formatted string. If newlines? is true then the sequence is formatted into 80 character rows.
Returns the biosequence as a fasta formatted string. If newlines? is true then the sequence is formatted into 80 character rows.
(fasta-reduce this func fold)
Applies a function to sequence data streamed line-by-line and
reduces the results using the supplied fold
function. Uses the
core reducers library so the fold function needs to have an
'identity' value that is returned when the function is called with
no arguments.
Applies a function to sequence data streamed line-by-line and reduces the results using the supplied `fold` function. Uses the core reducers library so the fold function needs to have an 'identity' value that is returned when the function is called with no arguments.
(get-interval-sequence interval bs)
Returns a fasta sequence corresponding to the provided interval.
Returns a fasta sequence corresponding to the provided interval.
(get-list obj & keys)
Low level macro for retrieving data from xml elements.
Low level macro for retrieving data from xml elements.
(get-one obj & keys)
Low level macro for retrieving data from xml elements.
Low level macro for retrieving data from xml elements.
(get-text obj & keys)
Low level macro for retrieving data from xml elements.
Low level macro for retrieving data from xml elements.
(id-convert ids from to email)
Takes a list of accessions and returns a hash-map mapping them to accessions from another database. If nothing found returns an empty hash-map and only returns entries that had a match. Uses the Uniprot id mapping utility and a list of supported databases is supplied at http://www.uniprot.org/faq/28#id_mapping_examples. Some common mappings include:
DB Name Abbreviation Direction UniProtKB AC/ID ACC+ID from UniProtKB AC ACC to EMBL/GenBank/DDBJ EMBL_ID both EMBL/GenBank/DDBJ CDS EMBL both Entrez Gene (GeneID) P_ENTREZGENEID both GI number P_GI both RefSeq Protein P_REFSEQ_AC both RefSeq Nucleotide REFSEQ_NT_ID both WormBase WORMBASE_ID both
There is a 100,000 limit on accessions in a single query imposed by Uniprot.
Takes a list of accessions and returns a hash-map mapping them to accessions from another database. If nothing found returns an empty hash-map and only returns entries that had a match. Uses the Uniprot id mapping utility and a list of supported databases is supplied at http://www.uniprot.org/faq/28#id_mapping_examples. Some common mappings include: DB Name Abbreviation Direction UniProtKB AC/ID ACC+ID from UniProtKB AC ACC to EMBL/GenBank/DDBJ EMBL_ID both EMBL/GenBank/DDBJ CDS EMBL both Entrez Gene (GeneID) P_ENTREZGENEID both GI number P_GI both RefSeq Protein P_REFSEQ_AC both RefSeq Nucleotide REFSEQ_NT_ID both WormBase WORMBASE_ID both There is a 100,000 limit on accessions in a single query imposed by Uniprot.
(index-biosequence-collection collection
file
&
{:keys [func index]
:or {func accession index nil}})
(index-biosequence-file file
&
{:keys [func index] :or {func accession index nil}})
(index-combine-files files outpath)
Takes a collection of biosequence files and generates an index combining all the files. If files have parameters only the parameter object from the last file will be retained.
Takes a collection of biosequence files and generates an index combining all the files. If files have parameters only the parameter object from the last file will be retained.
(init-fasta-file path alphabet & opts)
Initialises fasta protein file. Accession numbers and description are processed by splitting the string on the first space, the accession being the first value and description the second. Encoding can be specified using the :encoding keyword, defaults to UTF-8.
Initialises fasta protein file. Accession numbers and description are processed by splitting the string on the first space, the accession being the first value and description the second. Encoding can be specified using the :encoding keyword, defaults to UTF-8.
(init-fasta-sequence accession description alphabet sequence)
(init-fasta-sequence accession description alphabet sequence check-seq)
Returns a new fastaSequence. Currently :iupacNucleicAcids and :iupacAminoAcids are supported alphabets.
Returns a new fastaSequence. Currently :iupacNucleicAcids and :iupacAminoAcids are supported alphabets.
(init-fasta-string str alphabet)
Initialises a fasta string. Accession numbers and description are processed by splitting the string on the first space, the accession being the first value and description the second.
Initialises a fasta string. Accession numbers and description are processed by splitting the string on the first space, the accession being the first value and description the second.
(interval-complete? interval)
Returns the interval if it has a start and end value or a point value, nil otherwise.
Returns the interval if it has a start and end value or a point value, nil otherwise.
(map->biosequenceIndex m__7585__auto__)
Factory function for class clj_biosequence.core.biosequenceIndex, taking a map of keywords to field values.
Factory function for class clj_biosequence.core.biosequenceIndex, taking a map of keywords to field values.
(map->biosequenceIndexReader m__7585__auto__)
Factory function for class clj_biosequence.core.biosequenceIndexReader, taking a map of keywords to field values.
Factory function for class clj_biosequence.core.biosequenceIndexReader, taking a map of keywords to field values.
(map->bzipped m__7585__auto__)
Factory function for class clj_biosequence.core.bzipped, taking a map of keywords to field values.
Factory function for class clj_biosequence.core.bzipped, taking a map of keywords to field values.
(map->fastaFile m__7585__auto__)
Factory function for class clj_biosequence.core.fastaFile, taking a map of keywords to field values.
Factory function for class clj_biosequence.core.fastaFile, taking a map of keywords to field values.
(map->fastaReader m__7585__auto__)
Factory function for class clj_biosequence.core.fastaReader, taking a map of keywords to field values.
Factory function for class clj_biosequence.core.fastaReader, taking a map of keywords to field values.
(map->fastaSequence m__7585__auto__)
Factory function for class clj_biosequence.core.fastaSequence, taking a map of keywords to field values.
Factory function for class clj_biosequence.core.fastaSequence, taking a map of keywords to field values.
(map->fastaString m__7585__auto__)
Factory function for class clj_biosequence.core.fastaString, taking a map of keywords to field values.
Factory function for class clj_biosequence.core.fastaString, taking a map of keywords to field values.
(map->gzipped m__7585__auto__)
Factory function for class clj_biosequence.core.gzipped, taking a map of keywords to field values.
Factory function for class clj_biosequence.core.gzipped, taking a map of keywords to field values.
(map->uncompressed m__7585__auto__)
Factory function for class clj_biosequence.core.uncompressed, taking a map of keywords to field values.
Factory function for class clj_biosequence.core.uncompressed, taking a map of keywords to field values.
(map->zipped m__7585__auto__)
Factory function for class clj_biosequence.core.zipped, taking a map of keywords to field values.
Factory function for class clj_biosequence.core.zipped, taking a map of keywords to field values.
(n50 reader)
Takes anything that can have `biosequence-seq' called on it and returns the N50 of the sequences therein.
Takes anything that can have `biosequence-seq' called on it and returns the N50 of the sequences therein.
(normalise-frame frame)
Takes a frame in the 'minus' format and returns it as a 1-6 frame.
Takes a frame in the 'minus' format and returns it as a 1-6 frame.
(object->file obj file)
Spits an object to file after making sure print-length is temporarily set to false.
Spits an object to file after making sure *print-length* is temporarily set to false.
(protein-charge p & {:keys [ph disulfides] :or {ph 7 disulfides 0}})
Calculates the theoretical protein charge at the specified pH (default 7). Uses pKa values set out in the protein alphabets from cl-biosequence.alphabet. Considers Lys, His, Arg, Glu, Asp, Tyr and Cys residues only and ignores all other amino acids. The number of disulfides can be specified and 2 times this figure will be deducted from the number of Cys residues used in the calculation. Values used for the pKa of the N-term and C-term are 9.69 and 2.34 respectively.
Calculates the theoretical protein charge at the specified pH (default 7). Uses pKa values set out in the protein alphabets from cl-biosequence.alphabet. Considers Lys, His, Arg, Glu, Asp, Tyr and Cys residues only and ignores all other amino acids. The number of disulfides can be specified and 2 times this figure will be deducted from the number of Cys residues used in the calculation. Values used for the pKa of the N-term and C-term are 9.69 and 2.34 respectively.
(reverse-comp this)
Returns a new fastaSequence with the reverse complement sequence.
Returns a new fastaSequence with the reverse complement sequence.
(reverse-seq this)
Returns a new fastaSequence with the reverse sequence.
Returns a new fastaSequence with the reverse sequence.
(six-frame-translation nucleotide)
(six-frame-translation nucleotide table)
Returns a lazy list of fastaSequence objects representing translations of a nucleotide biosequence object in six frames.
Returns a lazy list of fastaSequence objects representing translations of a nucleotide biosequence object in six frames.
(sub-bioseq bs beg)
(sub-bioseq bs beg end)
Returns a new fasta sequence object with the sequence corresponding to 'beg' (inclusive) and 'end' (exclusive) of 'bs'. If no 'end' argument returns from 'start' to the end of the sequence. Unlike vectors assumes NOT a zero based index to correlate with common intervals in annotation files.
Returns a new fasta sequence object with the sequence corresponding to 'beg' (inclusive) and 'end' (exclusive) of 'bs'. If no 'end' argument returns from 'start' to the end of the sequence. Unlike vectors assumes NOT a zero based index to correlate with common intervals in annotation files.
(translate bs frame & {:keys [table] :or {table (ala/codon-tables 1)}})
Returns a fastaSequence sequence representing the translation of the specified biosequence in the specified frame.
Returns a fastaSequence sequence representing the translation of the specified biosequence in the specified frame.
cljdoc is a website building & hosting documentation for Clojure/Script libraries
× close