Menu

Zifsoft releases Salsa Dragon (Planet ver 30)

Family Salsa Planet
Full project Name Platform version 30
Nickname: Salsa Dragon
Purpose of development:
  • Crawl the records in the database and check if if the url and email are still valid.
Description:
  • Crawling can be done by selecting a range of records and click crawlerUPgatt function. The Dragon crawler will check everything in the selected record range.
  • Dragon checks for similarities, if a record is the same yet the notes are different, it will merge the record.
  • If the record is dead, Dragon will “x” the url and em, i.e. the xurlem feature.
How to use:

  • Dragon is a collection of services in the Magneto and Selenium modules. There are a total of 18 Exposed features and 40 Hidden features in Dragon. The Hidden features can be added by customizing the Dbcln tool bar.
  • Dbcln (database cleaner) is where the exposed Dragon features are found.
  • Explanation of exposed features:
    • sortsht_biz sorts the db by business segment, country, state
    • xxxurlem is marks the record in the status field as “xxx” (closed), moves the url and email to the notes field with a date and “x” prefix indicating that they are no longer valid.
    • xxxurlemNclosetab closes the Chrome tab after Dragon gets all the data it needs from the internet.
    • mergecells merges records
    • homogenizautourl is a feature that analyzes similar records and fixes irregularities such misspelling (co names, addresses) the search criteria is based on URL.
    • homogenizeindiv is a feature that deals with B2C customers by placing them in the correct business segment.
    • getnmurlcofromemail = get contact name url company name from email address.
    • blockselectsimilarupdown = select similar records. You can point anywhere on the record. This feature will collect all similar records.
    • acqemchrome4cellclosetab = acquire email from chrome for selected cell, then close tab when done
    • acqemchrome4cellgatt2nclosetab is the same acqemchrome4cellclosetab but collection or email is done twice
    • acqemchrome4cellgatt3nclosetab is the same acqemchrome4cellclosetab but collection or email is done thrice
    •  checkurlsim checks records with similar urls for inconsistencies and fixes the inconsistencies. xurlem if the url is broken and email not valid.
    • checkurlcrawlerUP is checkurlsim but crawling up the records.
    • checkurlcrawlergatt is checkurlcrawlerUP but in the range. It includes grabbing data from Chrome and cleaning up after its data grab.
    • checkurlcrawlerget1mailNcrawlup checks URL, get mail from chrome once, crawl up
    • checkurlcrawlerget2mailNcrawlup checks URL, get mail from chrome twice, crawl up
    • checkurlcrawlerget3mailNcrawlup checks URL, get mail from chrome thrice, crawl up
Accessing:
  •  The main Salsa engine must uninstalled. You can do this by clicking on the path below.
Installation .exe: //shareddrive/SalsaPlanet/Platform/v30/Installer/install.bat

Release history:

Salsa Dragon is version 30 of the Salsa Engine. This version introduces an updated version of a crawler called Homogenize Crawler developed in 2016.  The new release takes advantage of the new cleansing features from Magneto 2.0
Salsa Caye is version 29 of the Salsa Engine. It fixes 420 bugs.
Salsa Boon is version 28 of the Salsa Engine. It introduces new search and navigation methods.

Salsa Arary is version 27 of the Salsa Engine. It opens the data types within the data field of each record allowing more diverse data.
Salsa Zu (ver Z) is the latest release. This release addresses the many config files in each App. It also paves the way for config abstraction layer.

Salsa Yoi (ver Y) addresses the need to include App fields into the config file.

Salsa Ximena (ver X) developed for the Aggressor App to rapidly move data in and out of the database.

Salsa Wakanda (ver W) is an experiment running Salsa Platform on a small network over several PCs with each PC running it’s own database. As in Marvel’s Wakanda, the Wakanda is an encryption decryption mechanism that allows authroize users to share data.

Salsa Vakona (ver V) is module built for the Scheduler App. It batch collects reference contact & emails from the resume reposity and matches the information to the candidate making it simpler for the recruiter to do background checks.

Salsa Ulyses (ver U) is an explorer program that collects local URL and derives new contacts from the url address.

Salsa Telerium (ver T) is a automatic email miner program. It collects telemetrics of a specific site and determines if it needs mine a website deeper. The telemetry it weighs on a webpage are contact richness, content richness , length of a page.

Salsa Solenoid (ver S) is an experimental module for using Salsa facilities over small network such as distributed computing, rapidly message sending.

Salsa Ran (ver R) is a thematic explorer. Type in a theme, e,g. “Singapore tuas factory”. It uses the theme to search for contacts or leads. When it’s done with it’s collection, a statistical report on the market size and market segment is generated.

Salsa Qiu (ver Q) searches a set of data from clipboard or Canvass for emails and URLs.

Salsa Pinius (ver P) is a semi automatic contact scrapping program.

Salsa Ocean (ver O) a third version of Ocean addressing field nomenclature and data type allowed in Salsa DB.

Salsa Netnymph (ver N) is a url explorer. From a single URL it’s able to map how the urls are connected.

Salsa Selenium (ver S) is a proprietary version of the Chrome Selenium. Salsa Selenium overcomes many of the problems in Chrome Selenium where the hooks are replaced with multiple source telemetry. Salsa allows importing data from the internet through a browser without any interface.

Salsa Magneto (ver M) is a collection of modules that cleanses the database.

Salsa driLL miner (ver L) is a database miner where it finds associated urls.

Salsa Kanvass (ver K) is opens up examining an imported webpage by separating and cataloging all the elements.

Salsa Gattlinger (ver G) is built for the Scheduler App. It is a fully automatic batch file importer and content analyzer. The file can of any format: pdf, xls, doc, docx, txt, xml. Gattglinger works a Salsa’s proprietary context-based-OCR to read, clean and covert data into text that is imported into the Salsa database.

Salsa versions C,D,E,F were scrapped. The idea to accelerate the development of Salsa using, open software turned out to be a dud. It took a lot of time to understand the limitations of the open software and the complexity to integrate these software into Salsa.

Salsa Battery (ver B) is a module that studies URL and how subsequent URLs from the Master page are constructed.

Salsa A (ver A) built to streamline sales information. The system was initially called 3S.

Leave a Reply

Your email address will not be published. Required fields are marked *

Protected with IP Blacklist CloudIP Blacklist Cloud

Lifvation Group