Flightplan
Flightplan is a Javascript library that makes it easy to scrape and parse airline websites for award inventory. It uses Puppeteer for scraping, which is built on top of headless Chrome, meaning it behaves just like a full-fledged Chrome browser, but it can be run from the command line with no visible window (allowing you to use your computer for other things). Furthermore, it can run on any platform supported by headless Chrome, which is just about everything (Windows, Mac, or Linux).
Why?
If you're sitting on a pile of airline miles or credit card points, you know that redeeming them can be difficult. Often, planning for my own trips, I would spend hours clicking through an airline's website, searching for available awards, while writing down what I found in a notebook. Eventually, I decided to automate that process, so I could free up my time. Flightplan doesn't scrape much faster than a human would, it simply will do it for hours on end without complaining or making mistakes. This can make planning complex award itineraries much less stressful!
Disclaimer: Scraping is generally against an airline's website's terms of service. As mentioned above, Flightplan typically doesn't place more load on a website than a normal human would, but unlike the human, it can run 24/7. So please use responsibly! Use of any scraping tool (or even excessive non-automated usage) can cause an airline to temporarily (or permanently) ban your IP or member account.
Supported Airlines
Airline | Website | Search | Parse |
---|
AC (Air Canada) | Aeroplan | :construction: | :construction: |
BA (British Airways) | Executive Club | :white_check_mark: | :construction: |
CX (Cathay Pacific) | AsiaMiles | :white_check_mark: | :white_check_mark: |
KE (Korean Air) | SKYPASS | :white_check_mark: | :white_check_mark: |
NH (All Nippon Airways) | ANA Mileage Club | :white_check_mark: | :white_check_mark: |
QF (Qantas) | Frequent Flyer | :construction: | :construction: |
SQ (Singapore Airlines) | KrisFlyer | :white_check_mark: | :white_check_mark: |
Getting Started
To use Flightplan, there are a few prerequisites that must be installed:
- Node.js 8.x or later (Installation instructions: (Windows | Mac | Linux)
- Yarn (Installation instructions)
- Install the proper build tools for your platform:
- Windows: Open an elevated PowerShell prompt (in taskbar search, type
powershell
, right-click on "Windows PowerShell" and select "Run as Administrator"). Then run npm install --add-python-to-path --global --production windows-build-tools
. Once the install is complete, you must restart your computer. - MacOS: Xcode will have already been installed if you followed the linked instructions to install
node
in Step 1 above. - Ubuntu:
sudo apt-get install build-essential
If you simply wish to install and run the Flightplan tools, use:
yarn global add flightplan-tool
mkdir flightplan
cd flightplan
flightplan
If you're a developer, you can install Flightplan to an existing Javascript project with:
yarn add flightplan-tool
Note: When you install Flightplan, it will be bundled with a recent version of Chromium automatically (so you do not need Chrome installed on your machine to use Flightplan).
Running the tools
For developers who wish to use Flightplan in their own Javascript projects, skip to the Library Usage section below. For everyone else, you're in the right place! If you'd like a video introduction to using Flightplan, check out the tutorial on YouTube:
http://www.youtube.com/watch?feature=player_embedded&v=QMtiucIPOxs
mkdir flightplan
cd flightplan
flightplan
flightplan search
The first time you run the search
command, you will be prompted to create a config/accounts.json
file. You must enter valid credentials in this file, for any engine you will be using, or login will fail.
After you've run a search, you should see the raw HTML and screenshots in the data
subdirectory. To extract award fares from the HTML and populate the database, run flightplan parse
.
When you are ready to run the React web UI, run both flightplan server
and flightplan client
. When the React app has finished loading, a browser will automatically be opened, pointing to http://localhost:3000/
.
Library usage
With Flightplan, you specify an airline and get back an engine, which supports two operations: searching and parsing.
- Searching simply takes a query, and fetches the HTML response (or multiple responses for some websites, which break the results up across multiple tabs).
- Parsing takes those HTML responses, and returns the list of flight awards. (Note: currently, Flightplan ignores non-direct routes and partner awards, this may change in the future.)
This is useful, because searching is expensive, but parsing is cheap. So it makes sense to search once, but be able to parse many times (perhaps due to bug fixes or new features being added).
const fp = require('flightplan');
const cx = fp.new('cx');
(async () => {
await cx.initialize({ credentials: ['1234567890', 'passw0rd'] });
const { responses, error } = await cx.search({
fromCity: 'HKG', toCity: 'LHR',
departDate: '2019-03-06', cabin: 'first'
});
if (error) {
console.log(error);
return;
}
const { awards } = cx.parse(responses);
console.log(awards);
})();
You can also instruct the search engine to save both the HTML output, and even screenshots! :tada: This makes debugging what might've gone wrong later much easier. Let's try it out:
const fp = require('flightplan');
const sq = fp.new('sq');
(async () => {
await sq.initialize({ username: '1234567890', password: '123456' });
const { htmlFiles, screenshots, fileCount, error } = await sq.search({
fromCity: 'SIN', toCity: 'HKG',
departDate: '2019-03-06', cabin: 'business',
htmlFile: 'output.html', screenshot: 'output.jpg'
});
if (!error) {
console.log('Files Saved:', fileCount);
console.log('HTML:', htmlFiles);
console.log('Screenshots:', screenshots);
}
})();
More API details to come later...