speech processing
Sponsored Links
Sponsored Links
Secleted [ 0 ] software to compare
Results 1 - 15 of about 2838
SoundTouch Sound Processing Library 1.3.0
SoundTouch Sound Processing Library is an open-source audio processing library. more>>
SoundTouch Sound Processing Library is an open-source audio processing library for changing the Tempo, Pitch and Playback Rates of audio streams or files:
- Tempo (time-stretch): Changes the sound to play at faster or slower speed than original, without affecting the sound pitch.
- Pitch (key) : Changes the sound pitch or key, without affecting the sound tempo or speed.
- Playback Rate : Changes both the sound tempo and pitch, as if an LP disc was played at wrong RPM rate.
Main features:
- Easy-to-use implementation of time-stretch, pitch-shift and sample rate transposing routines.
- High-performance object-oriented C++ implementation.
- Full source codes available for both the SoundTouch library and the example application.
- Clear and easy-to-use programming interface via a single C++ class.
- Supported audio data format : 16Bit integer or 32bit floating point PCM mono/stereo
- Capable of real-time audio stream processing:
- input/output latency max. ~ 100 ms.
- Processing 44.1kHz/16bit stereo sound in realtime requires a 133 Mhz Intel Pentium processor or better.
- Platform-independent implementation: The SoundTouch library can be compiled for any processor and OS platform supporting GNU C compiler (gcc) or Visual Studio, for example Win32, Linux, AIX.
- Additional assembler-level and Intel-MMX instruction set optimizations for Intel x86 compatible processors (Win32 & Linux platforms), offering several times increase in the processing performance.
- Compiled executable binaries available for Windows.
Enhancements:
- Add features/limitations/changes here
<<less- Tempo (time-stretch): Changes the sound to play at faster or slower speed than original, without affecting the sound pitch.
- Pitch (key) : Changes the sound pitch or key, without affecting the sound tempo or speed.
- Playback Rate : Changes both the sound tempo and pitch, as if an LP disc was played at wrong RPM rate.
Main features:
- Easy-to-use implementation of time-stretch, pitch-shift and sample rate transposing routines.
- High-performance object-oriented C++ implementation.
- Full source codes available for both the SoundTouch library and the example application.
- Clear and easy-to-use programming interface via a single C++ class.
- Supported audio data format : 16Bit integer or 32bit floating point PCM mono/stereo
- Capable of real-time audio stream processing:
- input/output latency max. ~ 100 ms.
- Processing 44.1kHz/16bit stereo sound in realtime requires a 133 Mhz Intel Pentium processor or better.
- Platform-independent implementation: The SoundTouch library can be compiled for any processor and OS platform supporting GNU C compiler (gcc) or Visual Studio, for example Win32, Linux, AIX.
- Additional assembler-level and Intel-MMX instruction set optimizations for Intel x86 compatible processors (Win32 & Linux platforms), offering several times increase in the processing performance.
- Compiled executable binaries available for Windows.
Enhancements:
- Add features/limitations/changes here
Download (0.42MB)
Added: 2006-02-15 License: GPL (GNU General Public License) Price:
1354 downloads
Workorder Processing System 0.3.0
Workorder Processing System allows you to quickly set up a work order/ticket tracking system for most service related businesses more>>
Workorder Processing System allows you to quickly set up a work order/ticket tracking system for most service related businesses.
Workorder Processing System is capable of handling multiple branches, which allows all branches to run from one central server connected to the Internet.
Main features:
- Multi branch capable with central server
- Customer address database
- Customer work history
- Searchable ticket database
- Business appointment scheduler
<<lessWorkorder Processing System is capable of handling multiple branches, which allows all branches to run from one central server connected to the Internet.
Main features:
- Multi branch capable with central server
- Customer address database
- Customer work history
- Searchable ticket database
- Business appointment scheduler
Download (0.21MB)
Added: 2007-04-16 License: GPL (GNU General Public License) Price:
924 downloads
Scilab Image Processing Toolbox 0.4.0
Scilab Image Processing Toolbox is an imaging, computer vision, and shape analysis package for SciLab. more>> <<less
Download (3.9MB)
Added: 2006-04-15 License: GPL (GNU General Public License) Price:
1321 downloads
bibEC Process Card 1.3.12
bibEC Process Card is a multiple gateway credit card payment processor. more>>
bibEC Process Card is a multiple gateway credit card payment processor.
This class is meant to process credit card payments via one of multiple payment gateways that are supported.
Currently the class supports the following payment gateways: Plug and Pay, Authorize.net, ViaKlix, and paynet . Changing between payment gateways is mostly a matter of changing the class constructor parameter.
The class provides a payment gateway independent API with functions for:
- Logging the payment activity
- Set the payment gateway authentication credentials
- Set the paying customer details
- Set the ship to details
- Set the credit card details
- Set the valuta
- Specify the order details
- Submit the payment processing request and retrieving the results
The payment submission is done securely when possible using PHP SSL socket connections or the Curl extension for PHP or the Curl command line.
<<lessThis class is meant to process credit card payments via one of multiple payment gateways that are supported.
Currently the class supports the following payment gateways: Plug and Pay, Authorize.net, ViaKlix, and paynet . Changing between payment gateways is mostly a matter of changing the class constructor parameter.
The class provides a payment gateway independent API with functions for:
- Logging the payment activity
- Set the payment gateway authentication credentials
- Set the paying customer details
- Set the ship to details
- Set the credit card details
- Set the valuta
- Specify the order details
- Submit the payment processing request and retrieving the results
The payment submission is done securely when possible using PHP SSL socket connections or the Curl extension for PHP or the Curl command line.
Download (MB)
Added: 2007-06-15 License: Freeware Price:
861 downloads
Process Raw Images 0.2
Process Raw Images is a service menu for processing many raw images at once. Supports output to Jpeg, Png and Tiff. more>>
Process Raw Images is a service menu for processing many raw images at once. Supports output to Jpeg, Png and Tiff.
<<less Download (0.006MB)
Added: 2007-02-05 License: GPL (GNU General Public License) Price:
996 downloads
speechd-el 2.0
speechd-el is an Emacs client to Speech Dispatcher, BrlTTY, and other alternative output devices. more>>
speechd-el is an Emacs client to Speech Dispatcher, BrlTTY, and other alternative output devices.
speechd-el provides a complex speech/Braille interface to Emacs, and is focused especially on (but not limited to) blind and visually impaired users.
It allows the user to work with Emacs without looking on the screen, using the speech and Braille output.
Enhancements:
- The output mechanism was modularized and redesigned to support multiple output devices.
- The most important impact of this change is BrlTTY support, which works in a way similar to how speech output works.
- This means that speechd-el can be used with both Speech Dispatcher and BrlTTY now.
<<lessspeechd-el provides a complex speech/Braille interface to Emacs, and is focused especially on (but not limited to) blind and visually impaired users.
It allows the user to work with Emacs without looking on the screen, using the speech and Braille output.
Enhancements:
- The output mechanism was modularized and redesigned to support multiple output devices.
- The most important impact of this change is BrlTTY support, which works in a way similar to how speech output works.
- This means that speechd-el can be used with both Speech Dispatcher and BrlTTY now.
Download (0.14MB)
Added: 2006-07-07 License: GPL (GNU General Public License) Price:
1204 downloads
Speech Recognition HOWTO 2.0
Speech Recognition HOWTO is a project that contains instructions for using speech recognition software and hardware. more>>
Speech Recognition HOWTO is a project that contains instructions for using speech recognition software and hardware and their use on Linux systems.
Enhancements:
- Changed license information (now GFDL) and added a new publication.
<<lessEnhancements:
- Changed license information (now GFDL) and added a new publication.
Download (MB)
Added: 2006-10-09 License: (FDL) GNU Free Documentation License Price:
1124 downloads
Julius Speech Recognition Engine 3.5.3
Julius is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) decoder software. more>>
Julius is a high-performance, two-pass large vocabulary continuous speech recognition (LVCSR) decoder software for speech-related researchers and developers. Based on word 3-gram and context-dependent HMM, it can perform almost real-time decoding on most current PCs in 20k word dictation task.
Major search techniques are fully incorporated such as tree lexicon, N-gram factoring, cross-word context dependency handling, enveloped beam search, Gaussian pruning, Gaussian selection, etc. Besides search efficiency, it is also modularized carefully to be independent from model structures, and various HMM types are supported such as shared-state triphones and tied-mixture models, with any number of mixtures, states, or phones.
Standard formats are adopted to cope with other free modeling toolkit. The main platform is Linux and other Unix workstations, and also works on Windows. Julius is distributed with open license together with source codes, and has been used by many researchers and developers in Japan.
Main features:
- An open-source software (see terms and conditions of license (Japanese and English translation)).
- Real-time hi-speed and accurate recognition based on 2-pass strategy.
- Low memory requirement: less than 32MBytes required for work area (<<less
Major search techniques are fully incorporated such as tree lexicon, N-gram factoring, cross-word context dependency handling, enveloped beam search, Gaussian pruning, Gaussian selection, etc. Besides search efficiency, it is also modularized carefully to be independent from model structures, and various HMM types are supported such as shared-state triphones and tied-mixture models, with any number of mixtures, states, or phones.
Standard formats are adopted to cope with other free modeling toolkit. The main platform is Linux and other Unix workstations, and also works on Windows. Julius is distributed with open license together with source codes, and has been used by many researchers and developers in Japan.
Main features:
- An open-source software (see terms and conditions of license (Japanese and English translation)).
- Real-time hi-speed and accurate recognition based on 2-pass strategy.
- Low memory requirement: less than 32MBytes required for work area (<<less
Download (1.0MB)
Added: 2007-01-04 License: BSD License Price:
1054 downloads
Process Viewer 0.5.0
Process Viewer is a small utility similar to top which displays all the processes on a linux system. more>>
Process Viewer is a small utility similar to top which displays all the processes on a linux system. Its written using the FOX Toolkit.
Process Viewer is licensed under the GNU General Public License
<<lessProcess Viewer is licensed under the GNU General Public License
Download (0.027MB)
Added: 2005-10-13 License: GPL (GNU General Public License) Price:
1512 downloads
Java Parallel Processing Framework 1.0 Beta 1
Java Parallel Processing Framework is a set of tools and APIs to facilitate the parallelization of CPU intensive applications. more>>
Java Parallel Processing Framework is a set of tools and APIs to facilitate the parallelization of CPU intensive applications, and distribute their execution over a network of heterogenous nodes.
Java Parallel Processing Framework is intended to run in clusters and grids.
Main features:
- an API to delegate the processing of parallelized tasks to local and remote execution services
- a set of APIs and user interface tools to administrate and monitor execution services
- real-time adaptive load balancing capabilities
- scalability up to an arbitrary number of processing nodes
- support for failover and recovery
- limited intrusiveness for existing or legacy code
- a dynamic deployment mechanism, that enables the execution of new, or updated, code without having to deploy onto the grid
- fully documented APIs, administration guide and developer guide
- runs on any platform supporting Java 2 Platform Standard Edition 5.0 (J2SE 1.5)
Enhancements:
- JPPF is now licensed under the terms of the Apache License, Version 2.0. Starting from JPPF 1.0 beta1, all releases will be licensed under these terms.
- Prior versions will remain under the terms of the LGPL.
- For the first time, JPPF integrates seamlessly with J2EE. The JPPF Resource Adapter provides J2EE application servers with an access to native grid services.
- Numerous bug fixes, especially with regards to the distributed class loading mechanism.
<<lessJava Parallel Processing Framework is intended to run in clusters and grids.
Main features:
- an API to delegate the processing of parallelized tasks to local and remote execution services
- a set of APIs and user interface tools to administrate and monitor execution services
- real-time adaptive load balancing capabilities
- scalability up to an arbitrary number of processing nodes
- support for failover and recovery
- limited intrusiveness for existing or legacy code
- a dynamic deployment mechanism, that enables the execution of new, or updated, code without having to deploy onto the grid
- fully documented APIs, administration guide and developer guide
- runs on any platform supporting Java 2 Platform Standard Edition 5.0 (J2SE 1.5)
Enhancements:
- JPPF is now licensed under the terms of the Apache License, Version 2.0. Starting from JPPF 1.0 beta1, all releases will be licensed under these terms.
- Prior versions will remain under the terms of the LGPL.
- For the first time, JPPF integrates seamlessly with J2EE. The JPPF Resource Adapter provides J2EE application servers with an access to native grid services.
- Numerous bug fixes, especially with regards to the distributed class loading mechanism.
Download (7.8MB)
Added: 2007-07-31 License: The Apache License 2.0 Price:
816 downloads
Speech::Recognizer::ViaVoice 0.02
Speech::Recognizer::ViaVoice contains Perl wrappers for IBMs ViaVoice speech Recognizer library. more>>
Speech::Recognizer::ViaVoice contains Perl wrappers for IBMs ViaVoice speech Recognizer library.
SYNOPSIS
use Speech::Recognizer::ViaVoice;
$| = 1;
if (0 == connectEngine) {
if ( 0 == defineVocab(myTestVocab, [hello,world,quit]) ) {
print please say "hello", "world", or "quit" at each prompt..., "n";
while (0 == startListening) {
print "speak> ";
if (0 == recognize) {
my ($s, $score) = (getWord, getScore);
if (defined($s)) {
printf "%s, score=%dn", $s, $score;
if ($s eq quit) {
exit 0;
}
} else {
printf "not recognized!n";
}
}
stopListening;
}
}
disconnectEngine;
}
This module provides very basic use of IBMs ViaVoice library (http://www-4.ibm.com/software/speech/enterprise/te_3.html) in perl. It allows its user to pass a list of speech commands for recognition in a perl list, connect to the engine, and request recognized spoken words. The most notable feature missing from the current version is the ability to select different ViaVoice users. I plan to add this functionality shortly, but for now you must use the default user.
Consult the ViaVoice documentation for ViaVoice setup. Most will probably just need to run vvstartuserguru, which requires a java runtime environment. Results will be better if you take the time to do a little training to your own voice in this utility.
You can use this perl module for reconition of words from IBMs vocab dictionary for your chosen locale without further preparation. To recognize words not available in the dictionary or to recognize phrases, you need to create a file with phonetic spellings for use by the provided script pronunciations.pl (installed in /usr/local/bin by default). This utility reads a text file whose name is supplied as its sole argument. The file should contain lines with two fields separated by a tab character:
Deborah --- TAB -->D EH B OW R AX
Deborah --- TAB -->D EH B AXR R AX
Axl Rose -- TAB -->AE K S AX L R OW Z
The text on the left is the word or phrase to be recognized, and the text on the right is a phonetic spelling of the format specified in the ViaVoice docs (section 3 of /usr/doc/ViaVoice/bpreadme.txt in my installation). As in the example, a word or phrase might have more than one common pronunciation. You can provide all of them, one on each line.
pronunciations.pl will most likely require super user privileges to write its output to files in the ViaVoice lib directory:
/usr/lib/ViaVoice/vocabs/langs/En_US/pools/
<<lessSYNOPSIS
use Speech::Recognizer::ViaVoice;
$| = 1;
if (0 == connectEngine) {
if ( 0 == defineVocab(myTestVocab, [hello,world,quit]) ) {
print please say "hello", "world", or "quit" at each prompt..., "n";
while (0 == startListening) {
print "speak> ";
if (0 == recognize) {
my ($s, $score) = (getWord, getScore);
if (defined($s)) {
printf "%s, score=%dn", $s, $score;
if ($s eq quit) {
exit 0;
}
} else {
printf "not recognized!n";
}
}
stopListening;
}
}
disconnectEngine;
}
This module provides very basic use of IBMs ViaVoice library (http://www-4.ibm.com/software/speech/enterprise/te_3.html) in perl. It allows its user to pass a list of speech commands for recognition in a perl list, connect to the engine, and request recognized spoken words. The most notable feature missing from the current version is the ability to select different ViaVoice users. I plan to add this functionality shortly, but for now you must use the default user.
Consult the ViaVoice documentation for ViaVoice setup. Most will probably just need to run vvstartuserguru, which requires a java runtime environment. Results will be better if you take the time to do a little training to your own voice in this utility.
You can use this perl module for reconition of words from IBMs vocab dictionary for your chosen locale without further preparation. To recognize words not available in the dictionary or to recognize phrases, you need to create a file with phonetic spellings for use by the provided script pronunciations.pl (installed in /usr/local/bin by default). This utility reads a text file whose name is supplied as its sole argument. The file should contain lines with two fields separated by a tab character:
Deborah --- TAB -->D EH B OW R AX
Deborah --- TAB -->D EH B AXR R AX
Axl Rose -- TAB -->AE K S AX L R OW Z
The text on the left is the word or phrase to be recognized, and the text on the right is a phonetic spelling of the format specified in the ViaVoice docs (section 3 of /usr/doc/ViaVoice/bpreadme.txt in my installation). As in the example, a word or phrase might have more than one common pronunciation. You can provide all of them, one on each line.
pronunciations.pl will most likely require super user privileges to write its output to files in the ViaVoice lib directory:
/usr/lib/ViaVoice/vocabs/langs/En_US/pools/
Download (0.008MB)
Added: 2007-06-13 License: Perl Artistic License Price:
865 downloads
Speech Dispatcher 0.6.2
Speech Dispatcher is a device independent layer for speech synthesis. more>> <<less
Download (0.81MB)
Added: 2007-07-06 License: GPL (GNU General Public License) Price:
848 downloads
Speegle Define 1.1
Speegle Define is a Firefox extension that gives a spoken definition of a word using Speegle Speech Technology. more>>
Speegle Define is a Firefox extension that gives a spoken definition of a word using Speegle Speech Technology. You highlight the word you would like explained with highlight left click on any internet page you are reading.
Right click and choose "Audio Definition" from the pop up menu and its definition will be read back to you in English through your speakers or headphones.
<<lessRight click and choose "Audio Definition" from the pop up menu and its definition will be read back to you in English through your speakers or headphones.
Download (0.004MB)
Added: 2007-06-02 License: MPL (Mozilla Public License) Price:
874 downloads
gnome-speech 0.4.16
GNOME Speechs purpose is to provide a simple general API for producing text-to-speech output. more>>
GNOME Speechs purpose is to provide a simple general API for producing text-to-speech output.
The GNOME Speech 1.0 API is currently under development, and it will provide API for both text-to-speech output as well as speech input. It will be heavily influenced by the Java Speech API, both the existing 1.0 specification, and the new 2.0 one which is currently under development.
GNOME Speech 0.4 is made up of three basic parts:
- IDL interface definitions
- libgnomespeech - Convenience library for speech driver development
- Java package to aid in developing GNOME Speech drivers in Java
- Sample driver implementations
<<lessThe GNOME Speech 1.0 API is currently under development, and it will provide API for both text-to-speech output as well as speech input. It will be heavily influenced by the Java Speech API, both the existing 1.0 specification, and the new 2.0 one which is currently under development.
GNOME Speech 0.4 is made up of three basic parts:
- IDL interface definitions
- libgnomespeech - Convenience library for speech driver development
- Java package to aid in developing GNOME Speech drivers in Java
- Sample driver implementations
Download (0.46MB)
Added: 2007-07-30 License: GPL (GNU General Public License) Price:
821 downloads
SpeakGoodChinese 0.5
Speak Good Chinese is a cross-platform application that allows you or your students to train their Mandarin pronunciation. more>>
Speak Good Chinese is a cross-platform application based on GTK technology that allows you or your students to train their Mandarin pronunciation. Our software is based on Praat, leading software in speech analysis. Our speech technology is backed by the Institute of Phonetic Sciences part of the University of Amsterdam.
SpeakGoodChinese software was supported financially by the DU. Questions can be mailed to stefandekonink@xs4all.nl, they will be forwarded to the appropriate project member.
Enhancements:
- This release makes administrative tasks easier for teachers or students that want to manage or distribute their own wordlists.
- Another new feature in this subsystem is the shuffle function.
- Several bugs were solved related to cleaning up old recordings.
- New documentation was added.
<<lessSpeakGoodChinese software was supported financially by the DU. Questions can be mailed to stefandekonink@xs4all.nl, they will be forwarded to the appropriate project member.
Enhancements:
- This release makes administrative tasks easier for teachers or students that want to manage or distribute their own wordlists.
- Another new feature in this subsystem is the shuffle function.
- Several bugs were solved related to cleaning up old recordings.
- New documentation was added.
Download (9.0MB)
Added: 2007-06-24 License: GPL (GNU General Public License) Price:
858 downloads
Secleted [ 0 ] software to compare
Copyright Notice:
Software piracy is theft, Using crack, password, serial numbers, registration codes, key generators is illegal and prevent future software development. The above speech processing search only lists software in full, demo and trial versions for free download. Download links are directly from our mirror sites or publisher sites, torrent files or links from rapidshare.com, yousendit.com or megaupload.com are not allowed