Add Spark Connect page

This PR adds a Spark Connect page.

Sorry for the huge PR.  This PR adds Spark Connect to the dropdown which apparently modifies a bunch of HTML files.  Let me know if this is OK!

<img width="501" alt="Screenshot 2024-04-02 at 4 34 17 PM" src="https://github.com/apache/spark-website/assets/2722395/e1279758-e216-4123-acbc-1b244d9c63f6">

Author: Matthew Powers <matthewkevinpowers@gmail.com>

Closes #511 from MrPowers/add-spark-connect.
245 files changed
tree: 0a4dc2d4445d45bd0bb2bef466436debe55231f8
  1. .bundle/
  2. .github/
  3. _layouts/
  4. css/
  5. graphx/
  6. images/
  7. js/
  8. mllib/
  9. news/
  10. releases/
  11. screencasts/
  12. site/
  13. spark-connect/
  14. sql/
  15. streaming/
  16. talks/
  17. .asf.yaml
  18. .gitignore
  19. _config.yml
  20. apple-touch-icon.png
  21. committers.md
  22. community.md
  23. contributing.md
  24. developer-tools.md
  25. doap.rdf
  26. documentation.md
  27. downloads.md
  28. error-message-guidelines.md
  29. examples.md
  30. faq.md
  31. favicon-16x16.png
  32. favicon-32x32.png
  33. favicon.ico
  34. Gemfile
  35. Gemfile.lock
  36. history.md
  37. improvement-proposals.md
  38. index.md
  39. LICENSE
  40. mailing-lists.md
  41. merge_pr.py
  42. powered-by.md
  43. README.md
  44. release-process.md
  45. research.md
  46. robots.txt
  47. security.md
  48. sitemap.xml
  49. third-party-projects.md
  50. trademarks.md
  51. versioning-policy.md
README.md

Generating the website HTML

In this directory you will find text files formatted using Markdown, with an .md suffix.

Building the site requires Jekyll Rouge. The easiest way to install the right version of these tools is using Bundler and running bundle install in this directory.

See also https://github.com/apache/spark/blob/master/docs/README.md

A site build will update the directories and files in the site directory with the generated files. Using Jekyll via bundle exec jekyll locks it to the right version. So after this you can generate the html website by running bundle exec jekyll build in this directory. Use the --watch flag to have jekyll recompile your files as you save changes.

In addition to generating the site as HTML from the Markdown files, jekyll can serve the site via a web server. To build the site and run a web server use the command bundle exec jekyll serve which runs the web server on port 4000, then visit the site at http://localhost:4000.

Please make sure you always run bundle exec jekyll build after testing your changes with bundle exec jekyll serve, otherwise you end up with broken links in a few places.

Updating Jekyll version

To update Jekyll or any other gem please follow these steps:

  1. Update the version in the Gemfile
  2. Run bundle update which updates the Gemfile.lock
  3. Commit both files

Docs sub-dir

The docs are not generated as part of the website. They are built separately for each release of Spark from the Spark source repository and then copied to the website under the docs directory. See the instructions for building those in the readme in the Spark project's /docs directory.

Rouge and Pygments

We also use Rouge for syntax highlighting in documentation Markdown pages. Its HTML output is compatible with CSS files designed for Pygments.

To mark a block of code in your Markdown to be syntax highlighted by jekyll during the compile phase, use the following syntax:

{% highlight scala %}
// Your Scala code goes here, you can replace Scala with many other
// supported languages too.
{% endhighlight %}

You probably don't need to install that unless you want to regenerate the Pygments CSS file. It requires Python, and can be installed by running sudo easy_install Pygments.

Merge PR

To merge pull request, use the merge_pr.py script which also squashes the commits.