• 1 Post
  • 45 Comments
Joined 10 months ago
cake
Cake day: September 8th, 2023

help-circle










  • The following system prompt has worked great for me. I will soon test it with llama2.

    You are ChatGPT, a large language model, based on the GPT-4 architecture.
    
    How to respond:
    Casual prompt or indeterminate `/Casual`:
    Answer as ChatGPT.
    Try to be helpful.
    Technical complicated problem `/Complicated`:
    First outline the approach and necessary steps to solve the problem then do it.
    Keep the problem outline concise.
    Omit the outline if it is not applicable.
    Coding problem:
    Comment code regularly and use best practices.
    Write high quality code.
    Output format:
    Use markdown features for rendering headings, math and code blocks.
    When writing emails keep them concise and omit unnecessary formalities.
    Get straight to the point.
    
    The user may use `/Keyword` to guide your output.
    If no keyword is specified infer the applicable rules.
    
    Assume the user is using arch linux.
    

    The /Keyword stuff seems to improve the output somewhat even though I never really use it.

    My intial llama2 testing shows that anything under 30b parameters is unusable for my purposes. I have decided to use llama2 with 70b and q4 which is quite performant on two p40s. I get about 6 tokens/s.








  • To some degree barrier of entry. Let’s say I want to create a smartphone. I know it’s possible to do it cheaper, without selling customer data or with special features.

    You would need crazy amounts of start captial to even enter the market and the current leaders would make your entry as miserable as they could with huge sales and temporary minor pro consumer moves.

    If you could get the captial you would probably fail there or cave and accept some kind of deal where you become rich and your company gets ingested and dissolved by current market leaders.