Speechocean, as a well-known data resources and data services supplier, devotes itself to providing qualified databases and efficient
services for its academic and industrial customers and help them create diversified values in the fields of Human Computer Interaction and Human Language Technology.
Speechocean is capable of providing various types of large databases and data-related services in many languages and accents such
as data designing, collecting, transcribing, annotating, validating and linguistic services and other related processing services for many
technical fields such as speech synthesis, speech recognition, machine translation, web search, nature language understanding, image
recognition and etc.
According to customer’s specific requirement, we can also provide One-Stop Data Solution, which includes services of data design,
data collection, data processing, modeling & model training, testing and evaluation. Please click "One-Stop Data Solution" for details.
During past 15 years, Speechocean has been providing its customers over 1000 databases and various types of customized services
covering 110 languages and accents. Till now, Speechocean has established ten overseas offices equipped with experienced
international teams and sophisticated project management process in Hong Kong, UK, Germany, Spain, Canada, Russia, and
other countries and regions.
Based on its unique features of large-scale data solution capability with guaranty of high quality, cost optimization and fast delivery,
Speechocean won great reputation and trust and established long-term cooperation with diversified customers around the world.
Meanwhile, Speechocean is one of the world largest language resource providers. By the end of 2014, KingLine Data Center (operated by Speechocean) has nearly 500 large-scale off-the-shell corpora, covering 110+ languages/ accents and 70+ regions around the
world, could be authorized to customers. All these corpora, with perfection of independent intellectual right and multiple layers of
transcription and annotation, can meet customer’s diversified requirement for modeling and model training in Human Computer
Interaction field.(Please click "KingLine Data Center-Commercial Resources" for data list).
KingLine Data Center also has hundreds of high-quality academic resources to satisfy the experimental and testing needs of scientific
research institutions, colleges and individuals around the world. All these corpora could be provided with minimized cost which is far
beyond the actual value. We also welcome members to share and exchange data with us.(Please click "KingLine Data Center-Academic Resources" for data list).
Kingline Data Center, Language Resources, Speech Synthesis Datasets, Speech Recognition Datasets
Speech Data Transcription/ Annotation
Image Data Collection
Image Data Labelling
Multi-Language Linguistic Lexicon
News & Updates
We have provided services and established long-term strategic partnership with world-class customers such as Microsoft, IBM,
Google, Samsung, Canon, Nuance, Verint, Toshiba, Panasonic, Siemens, Baidu, Tencent and many others. We are acknowledged by
them and gain great confidence from them.
Email : Contact@speechocean.com
Address : D-801, U-center Building, No.28 Chengfu Road, Haidian District, Beijing China
Post Code : 100083
Fax : +86-10-62660053 ext. 8103
Phone : +86-10-62660053
Sales : +86-10-62666126
News and Information>>detail
Why Choose Us
Being a global data resources and data-related service provider, Speechocean owns the following leading edge:
- Speechocean provides data-related services covering 110+ languages (accents), which is still increasing along with the customer’s demand, through offices and work-studios located in more than 10 countries. With the support of native speakers and linguist, we are able to execute various types of data collection projects including speech, text, image and etc. (Please click Data Collection Service for more information), and provides localized data processing services and technical support, including transcription, annotation, testing and evaluation, tools development, model training, linguistic consultancy and etc. (Please click Data Analysis and Processing Service for more information)
- KngLine Data Center, operated by Speechocean, is a data resource sharing platform with large-scale commercial data resources including speech corpus, text corpus, image corpus and pronunciation lexica, which could be licensed to customers for commercial purposes. Besides, there are plenty of high-quality academic resources could be used in research, model training and testing purposes. Speechocean owns independent Intelligent Property Right for both commercial and academic resources.
- Speechocean provides One-stop Data Service Solution, which includes services from data collection facility providing, balanced script design, data collection and processing project management, multi-language linguistic service, model training, testing and evaluation and etc.
- Speechocean has a sophisticated project manager team and process control system. With nearly 20 years of global major project experience, we can ensure excellent project managements and guarantee low risk and high quality services to customers.
- Speechocean also provides services in tools developing for data collection and processing, model training, testing and evaluation, linguistics Consulting and etc. by an experience strong R & D team. Please click Technical Support for more details.
Lin He - Chairman
Lin He is the founder of Speechocean and has dedicated in the field of speech science technology since 1993. She dedicated herself in research and development in the field of speech recognition, speech synthesis, language understanding in mandarin. She is very
experienced in speech data production and has led many large government projects. She is also directly involved in the projects in
110+ languages.She is currently a Member of Linguistic, Music and Auditory Sense Committee of the Acoustical Society of China and
also a Member of the Chinese Linguistic Data Consortium (CLDC)
Difei Tang - President & CEO
Mr. Tang Difei graduated from University of Science and Technology of China in 1996 with the Human-Machine Speech
Communication Test Lab, and got a master's degree in engineering.During the period from 1996 to 2012, Mr. Tang Difei worked at
Lenovo, Microsoft Asia Academy of Engineering and Aliyun Cloud Computing Ltd,. He played a key role in some major projects such
as multi lingual speech synthesis engine research and development, Microsoft Chinese phonetic input method, based on the
application and development of Windows Mobile phone platform, Microsoft medical management system solutions, and lead the
team to achieve breakthrough. Mr. Tang Difei served as a senior product development management, years of business management
positions in the international leading enterprise, has accumulated nearly 20 years of rich experience in business innovation, product
development and operations management. After join us, strategy management, business development strategy and organization
efficiency has been remarkably promoted.
Xianfeng Cheng (George) - Vice President & Sales Director
Mr. Cheng is Vice President of Speechocean who is responsible for sales and marketing, market development while involving
company's strategic planning. He is also in charge of the Legal Affairs Department of Speechocean currently.George Jointed
Speechocean as senior project manager in 2005 and experienced in managing numerous international and governmental projects of
languages such as of speech, web search, text analysis, graph & image, etc. which won customers'trust and praise. From 2007-2012,
George held the post of Business manager and is responsible for company strategy, market research, new business developing,
international projects negotiation and sales growth for both licensable materials and services. During his tenure, He has led the
company through a period of extensive growth driven by data solutions and innovative services providing.
Ke Li - Vice President
Mr. Li graduated from Department of Electronic Engineering,Tsinghua University and was awarded master's degree in 2006. He is
the Data Service Director of Speechocean, responsible for the company's data business operation management of speech & language,
web search, text analysis and Graphs & images since 2009. Mr. Li has rich experience in technology research & development and
project management. From 2006 to 2009, he was responsible for R&D and management of numerous speech technologies in IBM
China Development Lab.Since 2009, he has been a leader of a big project management and data annotation team in building a global
data collection and processing network, hence accomplishing the strategic layout of company's data delivery center in major regions
across the world.
Yufeng HAO - Chief Scientist & R&D Director
Dr. Hao Yufeng graduated from Southeast University in 2004, received a doctor's degree in computational biology and artificial
intelligence technology; University of angers, France in 2005 and obtain Biological information science and engineering doctoral
degree. Dr. Hao Yufeng joined us in 2008 and at present is the Chief Scientist and the director of R&D Center, responsible for R&D
strategic planning , technological innovation and R&D management, strenghten the capacity in model training, linguistic consulting
services and solution development. Dr.Hao has over 15 years of experience in technology R&D,product R&D and project management
in algorithm development, model training and system evaluation and testing. From 2008 to 2012, he successively built linguist and
technology expert networks, which enabled company to make breakthroughs in multilingual corpus design, speech signal processing
module development and technology evaluation & testing services. From 2008 to 2012, he has established the linguists and experts
talent network, and breakthrough in multilingual corpus design, speech signal processing module development, technical evaluation,
and has obtained more than 20 national software copyright authentication. Under his leadership, Utrans data processing platform also
won the technology innovation award of Ministry of Science and Technology of China.
Siyao Lv - Financial Director & Secretary of the Board
Ms. Lv graduated from Guanghua School of Management, Peking University and got Bachelor degree of Accounting in 2005. In 2014,
she obtained Master degree of Finance in the School of Finance, Renmin University of China. Her department is responsible for
financial management, operating in capital market and official information disclosure.During 2005 to 2016, Ms. Lv worked in
PricewarterHouseCoopers, My Decker Capital, Intel China, and certified as CICPA, CIA, FCIB.She has many mature experiences in
financial management, compliance management, investment and financing, tax planning and other related areas.