fbpx
Wikipedia

Basis Technology

BasisTech is a software company specializing in applying artificial intelligence techniques to understanding documents and unstructured data written in different languages. It has headquarters in Somerville, Massachusetts with a subsidiary office in Tokyo. Its legal name is BasisTech LLC.

BasisTech
Company typePrivate
IndustryInformation technology
Information access
Digital forensics
Transliteration
Founded1995
HeadquartersSomerville, Massachusetts, United States
Area served
Americas
Europe
Asia
Key people
Carl Hoffman (CEO, Co-Founder)
Steven Cohen (EVP/COO, Co-Founder)
Brian Carrier (CTO and GM Cyber Forensics)
Simson Garfinkel (Chief Scientist)
Junichi Hasegawa (VP Asia)
ProductsKonaSearch
Cyber Triage
Autopsy
Sleuth Kit
SubsidiariesBasisTech GK
Websitehttp://www.basistech.com
http://www.konasearch.com
http://www.autopsy.com
http://www.cybertriage.com

The company was founded in 1995 by graduates of the Massachusetts Institute of Technology to use artificial intelligence techniques for natural language processing to help computer systems understand written human language. Its software focuses on analyzing freeform text so that applications can do a better job understanding the meaning of the words. For example, their software can identify tokens, part-of-speech, and lemmas.[1] The tools can also identify different forms of names and phrases. The name of someone, say Albert P. Jones for instance, can appear in many different ways. Some texts will call him "Al Jones", others "Mr. Jones" and others "Albert Paul Jons".[2]

Their software also performs entity extraction, that is finding words which refer to people, places, and organizations from text for uses such as due diligence, intelligence and metadata tagging.[3]

The company is best known for its Rosette product which uses Natural Language Processing techniques to improve information retrieval, text mining, search engines and other applications. The tool is used to enable search engines to search in multiple languages, [4] and match identities and dates.[5] Rosette was sold to Babel Street in 2002.[6]

BasisTech software is also used by forensic analysts to search through files for words, tokens, phrases or numbers that may be important to investigators,[7] as well as provide software (Cyber Triage) that helps organizations respond to cyberattacks.[8]

Rosette edit

Rosette comes as a cloud (public or on-premise) deployment or Java SDK.[9] Rosette provides a variety of natural language processing tools for unstructured text: language identification, base linguistics, entity extraction, name matching, name translation, sentiment analysis, semantic similarity, relationship extraction, topic extraction, categorization, and Arabic chat translation.[10] It can be integrated into applications to enhance financial compliance onboarding,[11] communication surveillance compliance,[12] social media monitoring,[13] cyber threat intelligence,[14] and customer feedback analysis.[15]

The Rosette Linguistics Platform is composed of these modules:

  • Rosette Language Identifier looks at the structural and statistical signature of the file to identify the language. The pre-configured software can recognize 55 different languages with 45 different encodings.
  • Rosette Base Linguistics identifies the lemma or word stem after finding the tokens. Search is often faster and more accurate when words are grouped by their stem.[16]
  • Rosette Entity Extractor analyzes raw text and identifies the probable role that words and phrases play in the document, a key step that makes it possible for algorithms to distinguish between the various meanings that many words can have. Splitting the raw text into groups of words according to their role and then classifying their contribution to meaning is often called entity analysis. The Basis hybrid approach mixes statistical modeling with rules, regular expressions, and gazetteers, lists of special words that can be tuned to the language and text to be analyzed. The tool is designed to work directly with varied alphabets and multiple languages, an advantage because foreign words are often transliterated in multiple ways.[17] It is believed to be the first commercially available tool for analyzing Arabic text.[18]
  • Rosette Name Translator transliterates non-Latin alphabets like Arabic into a consistent Latin form.
  • Rosette Name Indexer enables simple search across name variations either by plugging into open source search engines or as a standalone service.[19]
  • Rosette Core Library for Unicode smooths the use of Unicode text.[clarification needed]
  • Rosette Chat Translator for Arabic converts words from the Arabic chat alphabet to Arabic.

Rosette is used in both the United States government offices to support translation and by major Internet infrastructure firms like search engines.[20][21]

Digital forensics edit

BasisTech develops open-source digital forensics tools, The Sleuth Kit and Autopsy, to help identify and extract clues from data storage devices like hard disks or flash cards, as well as devices such as smart phones and iPods. The open-source licensing model allows them to be used as the foundation for larger projects like a Hadoop-based tool for massively parallel forensic analysis of very large data collections.

The digital forensics tool set is used to perform analysis of file systems, new media types, new file types and file system metadata. The tools can search for particular patterns in the files allowing it to target significant files or usage profiles. It can, for instance, look for common files using hash functions and also deconstruct the data structures of the important operating system log files.

The tools are designed to be customizable with an open plugin architecture. Basis Technology helps manage a large and diverse community of developers who use the tool in investigations.

KonaSearch edit

BasisTech acquired KonaSearch in June 2019,[22] a startup that specializes in search for Salesforce.com and other office database repositories, which can automate the search step of business workflows.[23]

References edit

  1. ^ "Base Linguistics".
  2. ^ "Name Indexer - Name Match".
  3. ^ "Entity Extractor - Entity Recognition".
  4. ^ "Elasticsearch Plugins - Elasticsearch Enrichment".
  5. ^ "Elasticsearch Plugins - Elasticsearch Enrichment".
  6. ^ "Babel Street Closes Highly Successful 2022 with Rosette Acquisition". www.businesswire.com. 2023-01-10. Retrieved 2024-04-11.
  7. ^ "Custom Solutions for Digital Forensics".
  8. ^ "About".
  9. ^ "Base Linguistics".
  10. ^ "Rosette Text Analytics".
  11. ^ "Uphold".
  12. ^ "Société Générale".
  13. ^ "Sensika".
  14. ^ "A Game-Changing Threat Intelligence Platform".
  15. ^ "Understand, Measure, and Act on Consumer Feedback".
  16. ^ Erard, Michael (March 1, 2004). "Translation in the Era of Terror". Technology Review.
  17. ^ Boyd, Clark (January 14, 2004). "Language tools for fight on terror". BBC News.
  18. ^ Weiss, Todd R. (March 10, 2003). "Language analysis software aids U.S. Web search for terrorist activity". Computerworld.
  19. ^ Profile in Boston Business Journal
  20. ^ Hollmer, Mark (March 21, 2003). "Basis Technology turns its focus to government security". Boston Business Journal.
  21. ^ Baker, Loren (November 30, 2004). "MSN Search Engine Uses Basis Technology for Natural Language Processing". Search Engine Journal.
  22. ^ "Basis Technology Brings Deep Search to Salesforce".
  23. ^ "About Us".

External links edit

  • Official website
  • Rosette website
  • Cyber Triage website
  • Autopsy digital forensics website
  • KonaSearch website

basis, technology, basistech, software, company, specializing, applying, artificial, intelligence, techniques, understanding, documents, unstructured, data, written, different, languages, headquarters, somerville, massachusetts, with, subsidiary, office, tokyo. BasisTech is a software company specializing in applying artificial intelligence techniques to understanding documents and unstructured data written in different languages It has headquarters in Somerville Massachusetts with a subsidiary office in Tokyo Its legal name is BasisTech LLC BasisTechCompany typePrivateIndustryInformation technology Information access Digital forensics TransliterationFounded1995HeadquartersSomerville Massachusetts United StatesArea servedAmericas Europe AsiaKey peopleCarl Hoffman CEO Co Founder Steven Cohen EVP COO Co Founder Brian Carrier CTO and GM Cyber Forensics Simson Garfinkel Chief Scientist Junichi Hasegawa VP Asia ProductsKonaSearch Cyber Triage Autopsy Sleuth KitSubsidiariesBasisTech GKWebsitehttp www basistech com http www konasearch com http www autopsy com http www cybertriage com The company was founded in 1995 by graduates of the Massachusetts Institute of Technology to use artificial intelligence techniques for natural language processing to help computer systems understand written human language Its software focuses on analyzing freeform text so that applications can do a better job understanding the meaning of the words For example their software can identify tokens part of speech and lemmas 1 The tools can also identify different forms of names and phrases The name of someone say Albert P Jones for instance can appear in many different ways Some texts will call him Al Jones others Mr Jones and others Albert Paul Jons 2 Their software also performs entity extraction that is finding words which refer to people places and organizations from text for uses such as due diligence intelligence and metadata tagging 3 The company is best known for its Rosette product which uses Natural Language Processing techniques to improve information retrieval text mining search engines and other applications The tool is used to enable search engines to search in multiple languages 4 and match identities and dates 5 Rosette was sold to Babel Street in 2002 6 BasisTech software is also used by forensic analysts to search through files for words tokens phrases or numbers that may be important to investigators 7 as well as provide software Cyber Triage that helps organizations respond to cyberattacks 8 Contents 1 Rosette 2 Digital forensics 3 KonaSearch 4 References 5 External linksRosette editRosette comes as a cloud public or on premise deployment or Java SDK 9 Rosette provides a variety of natural language processing tools for unstructured text language identification base linguistics entity extraction name matching name translation sentiment analysis semantic similarity relationship extraction topic extraction categorization and Arabic chat translation 10 It can be integrated into applications to enhance financial compliance onboarding 11 communication surveillance compliance 12 social media monitoring 13 cyber threat intelligence 14 and customer feedback analysis 15 The Rosette Linguistics Platform is composed of these modules Rosette Language Identifier looks at the structural and statistical signature of the file to identify the language The pre configured software can recognize 55 different languages with 45 different encodings Rosette Base Linguistics identifies the lemma or word stem after finding the tokens Search is often faster and more accurate when words are grouped by their stem 16 Rosette Entity Extractor analyzes raw text and identifies the probable role that words and phrases play in the document a key step that makes it possible for algorithms to distinguish between the various meanings that many words can have Splitting the raw text into groups of words according to their role and then classifying their contribution to meaning is often called entity analysis The Basis hybrid approach mixes statistical modeling with rules regular expressions and gazetteers lists of special words that can be tuned to the language and text to be analyzed The tool is designed to work directly with varied alphabets and multiple languages an advantage because foreign words are often transliterated in multiple ways 17 It is believed to be the first commercially available tool for analyzing Arabic text 18 Rosette Name Translator transliterates non Latin alphabets like Arabic into a consistent Latin form Rosette Name Indexer enables simple search across name variations either by plugging into open source search engines or as a standalone service 19 Rosette Core Library for Unicode smooths the use of Unicode text clarification needed Rosette Chat Translator for Arabic converts words from the Arabic chat alphabet to Arabic Rosette is used in both the United States government offices to support translation and by major Internet infrastructure firms like search engines 20 21 Digital forensics editBasisTech develops open source digital forensics tools The Sleuth Kit and Autopsy to help identify and extract clues from data storage devices like hard disks or flash cards as well as devices such as smart phones and iPods The open source licensing model allows them to be used as the foundation for larger projects like a Hadoop based tool for massively parallel forensic analysis of very large data collections The digital forensics tool set is used to perform analysis of file systems new media types new file types and file system metadata The tools can search for particular patterns in the files allowing it to target significant files or usage profiles It can for instance look for common files using hash functions and also deconstruct the data structures of the important operating system log files The tools are designed to be customizable with an open plugin architecture Basis Technology helps manage a large and diverse community of developers who use the tool in investigations KonaSearch editBasisTech acquired KonaSearch in June 2019 22 a startup that specializes in search for Salesforce com and other office database repositories which can automate the search step of business workflows 23 References edit Base Linguistics Name Indexer Name Match Entity Extractor Entity Recognition Elasticsearch Plugins Elasticsearch Enrichment Elasticsearch Plugins Elasticsearch Enrichment Babel Street Closes Highly Successful 2022 with Rosette Acquisition www businesswire com 2023 01 10 Retrieved 2024 04 11 Custom Solutions for Digital Forensics About Base Linguistics Rosette Text Analytics Uphold Societe Generale Sensika A Game Changing Threat Intelligence Platform Understand Measure and Act on Consumer Feedback Erard Michael March 1 2004 Translation in the Era of Terror Technology Review Boyd Clark January 14 2004 Language tools for fight on terror BBC News Weiss Todd R March 10 2003 Language analysis software aids U S Web search for terrorist activity Computerworld Profile in Boston Business Journal Hollmer Mark March 21 2003 Basis Technology turns its focus to government security Boston Business Journal Baker Loren November 30 2004 MSN Search Engine Uses Basis Technology for Natural Language Processing Search Engine Journal Basis Technology Brings Deep Search to Salesforce About Us External links editOfficial website Rosette website Cyber Triage website Autopsy digital forensics website KonaSearch website Retrieved from https en wikipedia org w index php title Basis Technology amp oldid 1218432705, wikipedia, wiki, book, books, library,

article

, read, download, free, free download, mp3, video, mp4, 3gp, jpg, jpeg, gif, png, picture, music, song, movie, book, game, games.